The Parsed Corpus of Scottish Correspondence

Dataset

Description

This resource is available for download in Kielipankki – the Language Bank of Finland.
The Parsed Corpus of Scottish Correspondence (PCSC; 1540-1750) is a corpus of letters written by Scottish writers which has been syntactically annotated in the Penn Parsed Corpora of Historical English (PPCHE) format. This places the PCSC within a family of sister corpora which use the PPCHE annotation conventions, e.g., corpora of historical English, Yiddish, French, etc., and makes the PCSC the first Scots corpus which can be easily used for comparative study within this family of corpora. The PCSC consists of data from the Helsinki Corpus of Scottish Correspondence, compiled by Annelie Meurman-Solin and the VARIENG group, and the metadata collected by the original compilers (e.g., on writer and addressee gender, writer origin, letter location, script type, etc.) makes this a suitable resource not only for studies on syntactic variation and change, but also on social factors affecting variation and change.
The original resource was produced by Anneli Meurman-Solin in 2017 and syntactically parsed and annotated by Lisa Gotthard in 2024.
When using this parsed version, please remember to include a separate reference to the original corpus version as required. Other versions of the original resource were previously made available in the Language Bank of Finland via the Korp service (see https://urn.fi/urn:nbn:fi:lb-201411071) and as a downloadable VRT version (see http://urn.fi/urn:nbn:fi:lb-202103291).
Date made available14 Jan 2025
PublisherFIN-CLARIN
Date of data productionDec 2018 - Jul 2024

Cite this