Unsupervised text pattern learning using minimum description length

Universal Communication Symposium(2010)

引用 1|浏览12
暂无评分
摘要
The knowledge of text patterns in a domain-specific corpus is valuable in many natural language processing (NLP) applications such as information extraction, question-answering system, and etc. In this paper, we propose a simple but effective probabilistic language model for modeling the in-decomposability of text patterns. Under the minimum description length (MDL) principle, an efficient unsupervised learning algorithm is implemented and the experiment on an English critical writing corpus has shown promising coverage of patterns compared with human summary.
更多
查看译文
关键词
computational linguistics,learning (artificial intelligence),english,minimum description length,text pattern,critical writing corpus,natural language processing,text analysis,probabilistic language model,unsupervised learning,probability,learning artificial intelligence,probabilistic logic,information extraction,language model,merging,dictionaries,question answering system
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要