Simple Data Augmentation for Multilingual NLU in Task Oriented Dialogue Systems.
CLiC-it(2020)
摘要
Data augmentation has shown potential in alleviating data scarcity for Natural Language Understanding (e.g. slot filling and intent classification) in task-oriented dialogue systems. As prior work has been mostly experimented on English datasets, we focus on five different languages, and consider a setting where limited data are available. We investigate the effectiveness of non-gradient based augmentation methods, involving simple text span substitutions and syntactic manipulations. Our experiments show that (i) augmentation is effective in all cases, particularly for slot filling; and (ii) it is beneficial for a joint intent-slot model based on multilingual BERT, both for limited data settings and when full training data is used.
更多查看译文
关键词
task oriented dialogue systems,multilingual nlu
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络