Ignorance is Bliss: Exploring Defenses Against Invariance-Based Attacks on Neural Machine Translation Systems

IEEE Transactions on Artificial Intelligence(2022)

引用 1|浏览19
暂无评分
摘要
This article addresses an invariance-based attack on the transformer, a state-of-the-art neural machine translation (NMT) system. Such attacks make multiple changes to the source sentence with the goal of keeping the predicted translation unchanged. Since the gold translation is not available for the adversarial sentences, tackling invariance-based attacks is a challenging task. We propose two contrasting defense strategies for the same, learn to deal and learn to ignore . In learn to deal , NMT system is trained not to predict the same translation for a clean text and its noisy counterpart, whereas in learn to ignore , NMT system is trained to output a dummy sentence in the target language whenever it encounters a noisy text. The experiments on two language pairs, English–German (en–de) and English–French (en–fr), show that learn to deal strategy reduces the attack success rate from 84.0% to 62.2% for en–de and from 84.6% to 73.8% for en–fr, whereas learn to ignore strategy reduces the attack success rate from 84.0% to 27.2% for en–de and from 84.6% to 37.0% for en–fr.
更多
查看译文
关键词
Adversarial robustness,deep learning,neural machine translation (NMT)
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要