Multilingual aligned parallel treebank corpus reflecting contextual information and its applications

MLR '04: Proceedings of the Workshop on Multilingual Linguistic Ressources(2004)

引用 14|浏览28
暂无评分
摘要
This paper describes Japanese-English-Chinese aligned parallel treebank corpora of newspaper articles. They have been constructed by translating each sentence in the Penn Treebank and the Kyoto University text corpus into a corresponding natural sentence in a target language. Each sentence is translated so as to reflect its contextual information and is annotated with morphological and syntactic structures and phrasal alignment. This paper also describes the possible applications of the parallel corpus and proposes a new framework to aid in translation. In this framework, parallel translations whose source language sentence is similar to a given sentence can be semi-automatically generated. In this paper we show that the framework can be achieved by using our aligned parallel treebank corpus.
更多
查看译文
关键词
parallel treebank corpus,corresponding natural sentence,source language sentence,parallel corpus,parallel translation,Kyoto University text corpus,new framework,target language,Penn Treebank,contextual information
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要