Clustering Analysis Of Feature Words In News Text Based On Co-Occurrence Matrix

2017 10TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI)(2017)

引用 24|浏览15
暂无评分
摘要
In this paper, we use a new method to improve the context of a news. Especially, a new TF-IDF formula is used to calculate the weight of the feature word of News. We set a parameter W to the feature words, which takes into account the factors such as the parts of speech of feature words and inverse document frequency. The parameter is adjusted based on the K-core theory, and therefore to determine the range of feature words. Our work aims to analysis the co-occurrence strength of news feature words in a certain period of time, and obtains the cluster analysis of the co-occurrence intensity distance of news feature words. The simulation results of clustering analysis is significant and the quantitative research methods can be commonly used in bibliometrics in the news field to analyze the news content.
更多
查看译文
关键词
component, Co-occurrence matrix, clustering analysis
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要