Fast, greedy model minimization for unsupervised tagging

COLING(2010)

引用 32|浏览53
暂无评分
摘要
Model minimization has been shown to work well for the task of unsupervised part-of-speech tagging with a dictionary. In (Ravi and Knight, 2009), the authors invoke an integer programming (IP) solver to do model minimization. However, solving this problem exactly using an integer programming formulation is intractable for practical purposes. We propose a novel two-stage greedy approximation scheme to replace the IP. Our method runs fast, while yielding highly accurate tagging results. We also compare our method against standard EM training, and show that we consistently obtain better tagging accuracies on test data of varying sizes for English and Italian.
更多
查看译文
关键词
standard em training,integer programming formulation,integer programming,unsupervised part-of-speech,test data,practical purpose,unsupervised tagging,greedy model minimization,novel two-stage greedy approximation,tagging accuracy,accurate tagging result,model minimization
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要