Concept-Enhanced Multi-view Clustering of Document Data

ISKE(2019)

引用 4|浏览13
暂无评分
摘要
Many works implemented multi-view clustering algorithms in document clustering. One challenging problem in document clustering is the similarity metric. Existing multi-view document clustering methods widely used two measurements: the Cosine similarity and the Euclidean Distance (ED). The first did not consider the magnitude between the two vectors. The second cannot compute the dissimilarity of two vectors that share the same ED. In this paper, we proposed a multi-view document clustering scheme to overcome these drawbacks by calculating the heterogeneity between documents with the same ED while taking into consideration their magnitudes. The experimental results show that the proposed similarity function can measure the similarity between documents more accurately than the existing metrics, and the proposed document clustering scheme goes beyond the limit of several state-of-the-art algorithms.
更多
查看译文
关键词
Document clustering,Similarity measurement,Multi-view clustering
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要