The Feature Selection Based On Crfs Model For Chinese Named Entity Recognition In Micro-Blog

INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMMUNICATION ENGINEERING (CSCE 2015)(2015)

引用 0|浏览0
暂无评分
摘要
Utilizing Natural Language Processing (NLP) for recognizing named entity from micro-blog was a new challenging, emerging research area. However, micro-blog texts lacked of good qualities, and instead a colloquial and mixed language, which bring great difficulties for NER on this kind of material. In this paper, we combined with the characteristics of Chinese character and micro-blog text; comparatively explored the effects of features selection on Conditional Random Fields (CRFs) model. Two categories of features were involved in our system: inner feature and external feature. The experiment results showed that appropriate features could effectively improve F-scores. Especially, the dictionary feature and POS feature made a bigger difference for NER performance.
更多
查看译文
关键词
NER, Micro-blog, CRFs, feature selection
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要