Training Stochastic Grammars From Unlabelled Text Corpora

msra(1992)

引用 30|浏览3
暂无评分
摘要
The paper describes various aspects and prac- ticalities of applying the "Hidden Markov" ap- proach to train parameters of regular and context- free stochastic grammars. The approach enables grammars to be trained from unlabelled text cor- pora, providing flexibility in the choice of syntac- tic categories and text domain. Part-of-speech tagging and parsing are discussed as applica- tions. Linguistic considerations can be used to de- velop constrained grammars, providing appropri- ate higher-order context for disamhiguation. Un- constrained grammars provide the opportunity to capture patterns that are not covered by a specific grammar. Experimental results are discussed for these alternatives.
更多
查看译文
关键词
higher order
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要