Topic and Trend Detection in Text Collections Using Latent Dirichlet Allocation

ADVANCES IN INFORMATION RETRIEVAL, PROCEEDINGS(2009)

引用 141|浏览0
暂无评分
摘要
Algorithms that enable the process of automatically mining distinct topics in document collections have become increasingly important due to their applications in many fields and the extensive growth of the number of documents in various domains. In this paper, we propose a generative model based on latent Dirichlet allocation that integrates the temporal ordering of the documents into the generative process in an iterative fashion. The document collection is divided into time segments where the discovered topics in each segment is propagated to influence the topic discovery in the subsequent time segments. Our experimental results on a collection of academic papers from CiteSeer repository show that segmented topic model can effectively detect distinct topics and their evolution over time.
更多
查看译文
关键词
subsequent time segment,citeseer repository show,distinct topic,latent dirichlet allocation,segmented topic model,generative process,academic paper,document collection,generative model,trend detection,topic discovery,time segment
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要