The TransBank Aligner: Cross-Sentence Alignment with Deep Neural Networks

TEXT, SPEECH, AND DIALOGUE (TSD 2019)(2019)

引用 0|浏览0
暂无评分
摘要
Sentence-aligned parallel bilingual corpora are the main and sometimes the only required resource for training Statistical and Neural Machine Translation systems. We propose an end-to-end deep neural architecture for sentence alignment. In addition to one-to-one alignment, our aligner can perform cross alignment as well. We used three language pairs from Europarl corpus and an English-Persian corpus to generate an alignment dataset. Using this dataset, we tested our system both in isolation and in an SMT system. In both settings, we obtained significantly better results compared to two competitive baselines.
更多
查看译文
关键词
Sentence alignment,Parallel corpora,Deep neural network
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要