Generating High-Quality Surface Realizations Using Data Augmentation and Factored Sequence Models.

MULTILINGUAL SURFACE REALISATION: SHARED TASK AND BEYOND(2018)

引用 1|浏览26
暂无评分
摘要
This work presents state of the art results in reconstruction of surface realizations from obfuscated text. We identify the lack of sufficient training data as the major obstacle to training high-performing models, and solve this issue by generating large amounts of synthetic training data. We also propose preprocessing techniques which make the structure contained in the input features more accessible to sequence models. Our models were ranked first on all evaluation metrics in the English portion of the 2018 Surface Realization shared task.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要