
SCOTS: Scottish Corpus Of Texts and Speech
The Scottish Corpora project has created large electronic corpora of written and spoken texts for the languages of Scotland, featuring nearly 4.6 million words of text, with audio recordings to accompany many of the spoken texts.
Recent years have brought significant changes to the political situation in Scotland. This new political situation has been accompanied by a resurgence of interest in the languages and culture of Scotland.
Advances in computer technology have made it possible to store and analyse large quantities of information in ways which were previously unthinkable. As a result, in recent years much research in the Humanities has focused on the building of large text archives and corpora. Such resources offer exciting opportunities to study language on a broad scale and with a precision which would otherwise be impossible.
The Scottish Corpora project has created large electronic corpora of written and spoken texts for the languages of Scotland. The Scottish Corpus of Texts & Speech (SCOTS) has been online since November 2004, and, after a number of updates and additions, has reached a total of nearly 4.6 million words of text, with audio recordings to accompany many of the spoken texts.
The corpus allow those interested in Scotland’s linguistic diversity, and in Scottish culture and identity, to investigate the languages of Scotland in new ways, and to address the gap which presently exists in our knowledge of these.
Main contact: Wendy Anderson
Developers: Brian AitkenDavid Beavan
Start year: 2002
End year: 2004
Funded by: AHRC
Subject area: English Language & Linguistics
Keywords: Corpus LinguisticsMappingScotsVisualisation
Record last updated 2020-01-13