News Topic Discovery Through Community Detection

2019 IEEE International Conference on Signal, Information and Data Processing (ICSIDP)(2019)

引用 0|浏览2
暂无评分
摘要
With the rapid development of communication and internet, there are a huge number of items of news every day. According to the characteristics of news dissemination, many pieces of news will focus on one topic about the same event or person. So, news topic discovery becomes a very important and urgent task in text mining. In fact, for news topic discovery, Latent Dirichlet Allocation (LDA) is the most frequently used model which considers each document being generated from a finite mixture of $K$ possible topics. However, the performance of LDA is not so satisfactory in practical applications. In this paper, we try to solve this problem through text structure mining. Our proposed method consists of two steps. The first step is to find out the topics as the clusters or communities of all the news items through the method of community detection, while the second step is to utilize the Bayesian unigram model to obtain the topic tokens for each topic. It is demonstrated by the experimental results that our proposed method can find out the topics much better than LDA on a real world news dataset.
更多
查看译文
关键词
topic discovery,LDA,text mining,community detection,network
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要