Three Dependency-and-Boundary Models for Grammar Induction.

Empirical Methods in Natural Language Processing(2012)

引用 15|浏览70
暂无评分
摘要
We present a new family of models for unsupervised parsing, Dependency and Boundary models, that use cues at constituent boundaries to inform head-outward dependency tree generation. We build on three intuitions that are explicit in phrase-structure grammars but only implicit in standard dependency formulations: (i) Distributions of words that occur at sentence boundaries --- such as English determiners --- resemble constituent edges. (ii) Punctuation at sentence boundaries further helps distinguish full sentences from fragments like headlines and titles, allowing us to model grammatical differences between complete and incomplete sentences. (iii) Sentence-internal punctuation boundaries help with longer-distance dependencies, since punctuation correlates with constituent edges. Our models induce state-of-the-art dependency grammars for many languages without special knowledge of optimal input sentence lengths or biased, manually-tuned initializers.
更多
查看译文
关键词
constituent edge,sentence boundary,head-outward dependency tree generation,longer-distance dependency,standard dependency formulation,state-of-the-art dependency grammar,Sentence-internal punctuation boundary,constituent boundary,full sentence,incomplete sentence,dependency-and-boundary model,grammar induction
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要