Multi-Label Emotion Classification for Imbalanced Chinese Corpus Based on CNN

2018 11th International Conference on Intelligent Computation Technology and Automation (ICICTA)(2018)

引用 3|浏览1
暂无评分
摘要
The paper uses NLP&CC2013 public data sets of emotion assessment tasks as the corpus. On account of these issues, such as the grammar of text is not standard, and the corpus is unbalanced distributing. Smote algorithm is put forward to over-sampling training data to get balanced corpus and CNN model is used to improve the sentence semantic differential. The optimal results of the paper improve 16.75% and 15.84% respectively compared with looseAP and strictAP of the optimal evaluation results in NLP&CC, increased by 11.15% and 11.55% respectively compared with the basic multi-label classifier ML-KNN. In addition, the paper also compares the performance of unbalanced and balanced training data based on CNN, confirming that the balanced training set can improve classification accuracy, at the same time through the change of the iterations, observe the semantic synthesis rule of CNN model.
更多
查看译文
关键词
emotion classification,multi-label classification,convolution neutral network,smote,word embedding
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要