An Algorithm To Identify Periods Of Establishment And Obsolescence Of Linguistic Items In A Diachronic Corpus

CORPORA(2021)

引用 0|浏览9
暂无评分
摘要
When exploring diachronic corpora, it is often beneficial for linguists to pinpoint not only the first or the last attestation dates of certain linguistic items, but also the moments in which they become more strongly established in the corpus or, conversely, the moments in which they, despite still being part of the language, become obsolete. In this paper, we propose an algorithm to assist the identification of such periods based on the frequency of items in a corpus. Our simple and generalisable algorithm can be used for the investigation of any linguistic item in any corpus which is divided into timeframes. We also demonstrate the applicability of our method using lexical data from the Corpus of Historical American English (COHA), providing case studies on the statistics and characteristics of words that appear in or disappear from this corpus in different periods.
更多
查看译文
关键词
COHA, diachronic corpus linguistics, English, lexical change, neologism, obsolete word
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要