Determining Word-Emotion Associations from Tweets by Multi-label Classification

2016 IEEE/WIC/ACM International Conference on Web Intelligence (WI)(2016)

引用 82|浏览36
暂无评分
摘要
The automatic detection of emotions in Twitter posts is a challenging task due to the informal nature of the language used in this platform. In this paper, we propose a methodology for expanding the NRC word-emotion association lexicon for the language used in Twitter. We perform this expansion using multi-label classification of words and compare different word-level features extracted from unlabelled tweets such as unigrams, Brown clusters, POS tags, and word2vec embeddings. The results show that the expanded lexicon achieves major improvements over the original lexicon when classifying tweets into emotional categories. In contrast to previous work, our methodology does not depend on tweets annotated with emotional hashtags, thus enabling the identification of emotional words from any domain-specific collection using unlabelled tweets.
更多
查看译文
关键词
word-emotion associations,tweet classification,multilabel word classification,automatic emotion detection,Twitter posts,NRC word-emotion association lexicon,word-level feature extraction
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要