Generating Adversarial Examples for Sentiment Classifier of Chinese Sentences

2020 6th International Symposium on System and Software Reliability (ISSSR)(2020)

引用 1|浏览5
暂无评分
摘要
Studies have shown that deep learning models are vulnerable to adversarial examples, which cause incorrect predictions by adding imperceptible perturbations into normal inputs. The same characteristic goes for the sentiment orientation classification models. If the input text for the models contains perturbing information such as typos or special symbols, the inputs could misled the models. The adversarial examples reflect the diversity of text features, and the flaws of the sentiment orientation classification models can be found by adversarial attacks. The adversarial examples can be used to train the sentiment orientation classification models to improve the robustness of the model. This paper proposes to generate adversarial examples of Chinese sentences by replacing one of the characters in the word with similar Chinese characters in a black-box manner. The experimental results show that the adversarial examples generated by this method cost less and visually closer to the normal text.
更多
查看译文
关键词
text classification,adversarial examples,robustness,sentiment tendency
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要