Learning Domain-Specific Word Embeddings from COVID-19 Tweets.

Steve Aibuedefe Aigbe,Christoph Eick

IEEE BigData(2021)

引用 2|浏览5
暂无评分
摘要
The COVID-19 global pandemic has been a major catastrophic event that impacted the world's economy. During the pandemic there was a rise in the use of social media such as Twitter by people to express their reactions and responses to the global pandemic. This drove researchers to analyze these micro-blogging texts, using natural language processing (NLP) methods, to understand information inherent in those texts. Most of these NLP tasks employ the use of word embeddings in training neural network models. These word embeddings are mainly trained on general text corpus which produce sub-optimal performance when used in domain-specific NLP tasks such as in COVID-19 related tweets. In this paper, we present a learned COVID-19 tweets domain-specific word embeddings for use in COVID-19 related tweets NLP tasks. Our evaluation results show that our domain-specific COVID-19 tweets word embeddings perform better than pretrained general word embeddings in a downstream domain-specific NLP task. Our COVID-19 tweets word embeddings are available for use by researchers who wish to perform downstream NLP tasks with pretrained domain-specific COVID-19 tweets word embeddings.
更多
查看译文
关键词
Domain-Specific,Word Embeddings,COVID-19,Tweets
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要