Extracting Topical Information Of Tweets Using Hashtags

2015 IEEE 14th International Conference on Machine Learning and Applications (ICMLA)(2015)

引用 12|浏览3
暂无评分
摘要
Twitter is one of the largest micro blogging web sites where users share news, their opinions, moods, recommendations by posting text messages, and it is mostly used like a news media. Since the data being shared via Twitter is vast, many researchers are focusing on extracting meaningful information with the help of information retrieval systems. Retrieving meaningful information from social media applications became important for several tasks such as sentiment analysis, detecting anomalies, and recommendation systems. Topic modeling is one of the mostly studied and hard problems in information retrieval area, and it is even more challenging to model topics when the documents are too short such as tweets. In this paper, we focus on developing an effective and efficient method to overcome this challenge of tweets being too short for topic modeling. We compare different topic modeling schemes, one of which is not studied before, based on Latent Dirichlet Allocation (LDA) that merges tweets in order to improve LDA performance. We also demonstrate our experimental results with unbiased data collection and evaluation methodologies.
更多
查看译文
关键词
topical information extraction,tweets,hashtags,Twitter,microblogging Web sites,text messages posting,news media,information retrieval systems,social media applications,sentiment analysis,anomalies detection,recommendation systems,topic modeling,latent Dirichlet allocation,LDA,data collection
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要