Improving similarity measures for short segments of text

AAAI(2007)

引用 173|浏览99
暂无评分
摘要
In this paper we improve previous work on measuring the similarity of short segments of text in two ways. First, we introduce a Web-relevance similarity measure and demonstrate its effectiveness. This measure extends the Web-kernel similarity function introduced by Sahami and Heilman (2006) by using relevance weighted inner-product of term occurrences rather than TF×IDF. Second, we show that one can further improve the accuracy of similarity measures by using a machine learning approach. Our methods outperform other state-of-the-art methods in a general query suggestion task for multiple evaluation metrics.
更多
查看译文
关键词
multiple evaluation metrics,term occurrence,web-relevance similarity measure,web-kernel similarity,relevance weighted inner-product,similarity measure,improving similarity measure,previous work,general query suggestion task,short segment,state-of-the-art method,inner product,machine learning
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要