Improving Retrieval by a Similarity Thesaurus based on Hyperlink Structure

Databases and Applications(2005)

引用 24|浏览5
暂无评分
摘要
One strategy to enhance the retrieval effectiveness of search engines is to apply automatic query expansion. For this purpose a similarity thesaurus may be applied in order to find new search terms. The similarity thesaurus may be constructed using a model for term comparison. Common methods to define term distances are based on the occur- rence frequencies of terms in documents. In this article we develop a new measure for term distances that is based on the hyperlink structure connecting documents. Hyperlinks frequently point to documents that concern similar topics. Based on this assumption in the presented system term dis- tances between terms in linked documents are decreased. We apply a search engine based on automatic query ex- pansion to evaluate this approach. In the experiments sim- ulated hyperlink graphs are applied to show the effect of different hyperlink topologies on the retrieval quality.
更多
查看译文
关键词
web structure min- ing,hyperlink-based information retrieval,search engine,information retrieval
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要