Implementation Of An Information Retrieval System Using The Soft Cosine Measure

NATURE-INSPIRED DESIGN OF HYBRID INTELLIGENT SYSTEMS(2017)

引用 2|浏览8
暂无评分
摘要
The retrieval information models have been of important study since 1992. These models are based on comparing a user query and a collection of documents taking into account the concurrency of the terms, with the objective to classify a set of relevant documents and retrieve them to the user in accordance with the evaluations criterion. There are metrics to classify a set of documents according to the grade of similarity, such as cosine similarity and soft cosine measure. In this paper, we perform a comparative study of these similarity metrics. The Vector Space Model (VSM) was implemented for retrieving information. A sample of the Collection of the Association for Computing Machinery (CACM) in the domain of Computer Science was used in the evaluation. The experiment results show that the recall is of 96 % in both metrics, but the soft cosine achieves 2 % more in mean average precision.
更多
查看译文
关键词
Vector space model, Similarity cosine, Soft cosine measure, CACM, Recall, Precision
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要