Research on Scientific Bibliography Clustering Technology Based on Citation Information Merging

2019 2nd International Conference on Artificial Intelligence and Big Data (ICAIBD)(2019)

引用 2|浏览6
暂无评分
摘要
Clustering analysis is an important step in extracting and analyzing technology bibliography trend. The bibliographic records, consisting of the title, authors, keywords, and publications, are short texts carrying less information than long texts, while bibliographic network has the citation relations among bibliographies. The experimental dataset therein refers to bibliographies in the computer field crawled from the existing literature databases. Based on traditional Hierarchical Clustering algorithm, this paper merges the cocitation relations. The experiment result shows that the mean silhouette coefficient of the Agglomerative Clustering algorithm that merges the co-citation relations among the bibliographies is improved obviously, thus effectively improving clustering result among bibliographies with strong co-citation relations, and good clusters provide a solid research basis for the follow-up bibliography trend analysis.
更多
查看译文
关键词
natural language processing,clustering algorithm,scientific bibliography,citation relations
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要