Joint Training and Decoding Using Virtual Nodes for Cascaded Segmentation and Tagging Tasks.

Xian Qian,Qi Zhang,Yaqian Zhou,Xuanjing Huang,Lide Wu

EMNLP '10: Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing（2010）

引用 6|浏览85

暂无评分

摘要

Many sequence labeling tasks in NLP require solving a cascade of segmentation and tagging subtasks, such as Chinese POS tagging, named entity recognition, and so on. Traditional pipeline approaches usually suffer from error propagation. Joint training/decoding in the cross-product state space could cause too many parameters and high inference complexity. In this paper, we present a novel method which integrates graph structures of two sub-tasks into one using virtual nodes, and performs joint training and decoding in the factorized state space. Experimental evaluations on CoNLL 2000 shallow parsing data set and Fourth SIGHAN Bakeoff CTB POS tagging data set demonstrate the superiority of our method over cross-product, pipeline and candidate reranking approaches.

查看译文

关键词

joint training,Chinese POS tagging,POS tagging data,tagging subtasks,cross-product state space,factorized state space,novel method,shallow parsing data,traditional pipeline,Fourth SIGHAN Bakeoff CTB,cascaded segmentation,tagging task,virtual node

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要