A Discriminative Matching Approach to Word Alignment

HLT/EMNLP(2005)

引用 258|浏览26
暂无评分
摘要
We present a discriminative, large-margin approach to feature-based matching for word alignment. In this framework, pairs of word tokens receive a matching score, which is based on features of that pair, including measures of association between the words, distortion between their positions, similarity of the orthographic form, and so on. Even with only 100 labeled training examples and simple features which incorporate counts from a large unlabeled corpus, we achieve AER performance close to IBM Model 4, in much less time. Including Model 4 predictions as features, we achieve a relative AER reduction of 22% in over intersected Model 4 alignments.
更多
查看译文
关键词
large-margin approach,relative aer reduction,feature-based matching,large unlabeled corpus,matching score,ibm model,word token,discriminative matching approach,aer performance close,intersected model,word alignment
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要