Joint Training and Decoding Using Virtual Nodes for Cascaded Segmentation and Tagging Tasks.

EMNLP '10: Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing(2010)

引用 6|浏览85
暂无评分
摘要
Many sequence labeling tasks in NLP require solving a cascade of segmentation and tagging subtasks, such as Chinese POS tagging, named entity recognition, and so on. Traditional pipeline approaches usually suffer from error propagation. Joint training/decoding in the cross-product state space could cause too many parameters and high inference complexity. In this paper, we present a novel method which integrates graph structures of two sub-tasks into one using virtual nodes, and performs joint training and decoding in the factorized state space. Experimental evaluations on CoNLL 2000 shallow parsing data set and Fourth SIGHAN Bakeoff CTB POS tagging data set demonstrate the superiority of our method over cross-product, pipeline and candidate reranking approaches.
更多
查看译文
关键词
joint training,Chinese POS tagging,POS tagging data,tagging subtasks,cross-product state space,factorized state space,novel method,shallow parsing data,traditional pipeline,Fourth SIGHAN Bakeoff CTB,cascaded segmentation,tagging task,virtual node
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要