Research on Chinese Text Automatic Categorization Based on VSM

2007 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-15(2007)

引用 3|浏览1
暂无评分
摘要
Automatic text classifying is an import application of the information processing technology. This paper introduces the key techniques of Chinese text categorization such as text preprocessing, feature selection, feature representation, training and classifying algorithm, especially analyses the current most important several feature selection methods with emphasis. A Chinese text classifier based on KNN algorithm was developed. The system can preferably implement Chinese automatic text categorization and has a higher quality. We also use this classifier to compare several feature selection methods. In the end, we utilize the experiment results to prove the importance role of feature selection in text categorization.
更多
查看译文
关键词
text categorization, vector space model, KNN algorithm
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要