Improving text simplification by corpus expansion with unsupervised learning

2019 International Conference on Asian Language Processing (IALP)(2019)

引用 1|浏览3
暂无评分
摘要
Automatic sentence simplification aims to reduce the complexity of vocabulary and expressions in a sentence while retaining its original meaning. We constructed a simplification model that does not require a parallel corpus using an unsupervised translation model. In order to learn simplification by unsupervised manner, we show that pseudo-corpus is constructed from the web corpus and that the corpus expansion contributes to output more simplified sentences. In addition, we confirm that it is possible to learn the operation of simplification by preparing large-scale pseudo data even if there is non-parallel corpus for simplification.
更多
查看译文
关键词
unsupervised machine translation,Japanese simplification,corpus expansion
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要