Efficient Tree-Based Topic Modeling.

Yuening Hu, Jordan L. Boyd-Graber

ACL '12: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Short Papers - Volume 2(2012)

引用 26|浏览44
暂无评分
摘要
Topic modeling with a tree-based prior has been used for a variety of applications because it can encode correlations between words that traditional topic modeling cannot. However, its expressive power comes at the cost of more complicated inference. We extend the SPARSELDA (Yao et al., 2009) inference scheme for latent Dirichlet allocation (LDA) to tree-based topic models. This sampling scheme computes the exact conditional distribution for Gibbs sampling much more quickly than enumerating all possible latent variable assignments. We further improve performance by iteratively refining the sampling distribution only when needed. Experiments show that the proposed techniques dramatically improve the computation time.
更多
查看译文
关键词
sampling distribution,sampling scheme,topic modeling,traditional topic,tree-based topic model,complicated inference,exact conditional distribution,inference scheme,latent Dirichlet allocation,possible latent variable assignment,efficient tree-based topic modeling
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要