Information Retrieval using a Singular Value Decomposition Model of Latent Semantic Structure

George W. Furnas,Scott Deerwester, Susan T. Durnais,Thomas K. Landauer,Richard A. Harshman, Lynn A. Streeter, Karen E. Lochbaum

SIGIR(2017)

引用 664|浏览2
暂无评分
摘要
In a new method for automatic indexing and retrieval, implicit higher-order structure in the association of terms with documents is modeled to improve estimates of term-document association, and therefore the detection of relevant documents on the basis of terms found in queries. Singular-value decomposition is used to decompose a large term by document matrix into 50 to 150 orthogonal factors from which the original matrix can be approximated by linear combination; both documents and terms are represented as vectors in a 50- to 150- dimensional space. Queries are represented as pseudo-documents vectors formed from weighted combinations of terms, and documents are ordered by their similarity to the query. Initial tests find this automatic method very promising.
更多
查看译文
关键词
original matrix,document matrix,initial test,automatic method,latent semantic structure,implicit higher-order structure,singular value decomposition model,dimensional space,information retrieval,singular-value decomposition,automatic indexing,new method,term-document association,higher order,singular value decomposition
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要