TopicView: Visually Comparing Topic Models of Text Collections

Tools with Artificial Intelligence(2011)

引用 21|浏览1
暂无评分
摘要
We present Topic View, an application for visually comparing and exploring multiple models of text corpora. Topic View uses multiple linked views to visually analyze both the conceptual content and the document relationships in models generated using different algorithms. To illustrate Topic View, we apply it to models created using two standard approaches: Latent Semantic Analysis (LSA) and Latent Dirichlet Allocation (LDA). Conceptual content is compared through the combination of (i) a bipartite graph matching LSA concepts with LDA topics based on the cosine similarities of model factors and (ii) a table containing the terms for each LSA concept and LDA topic listed in decreasing order of importance. Document relationships are examined through the combination of (i) side-by-side document similarity graphs, (ii) a table listing the weights for each document's contribution to each concept/topic, and (iii) a full text reader for documents selected in either of the graphs or the table. We demonstrate the utility of Topic View's visual approach to model assessment by comparing LSA and LDA models of two example corpora.
更多
查看译文
关键词
lda topic,text collections,model assessment,latent dirichlet allocation,topic models,lda model,conceptual content,topic view,lsa concept,full text reader,latent semantic analysis,document relationship,data models,computer model,layout,visualization,computational modeling,bipartite graph,text analysis,vectors,data visualisation,data model,content management,graph theory
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要