Abusive Language Detection In Online User Content

Chikashi Nobata,Joel Tetreault, Achint Thomas,Yashar Mehdad,Yi Chang

WWW(2016)

引用 1281|浏览603
暂无评分
摘要
Detection of abusive language in user generated online content has become an issue of increasing importance in recent years. Most current commercial methods make use of blacklists and regular expressions, however these measures fall short when contending with more subtle, less ham-fisted examples of hate speech. In this work, we develop a machine learning based method to detect hate speech on online user comments from two domains which outperforms a state-of-the-art deep learning approach. We also develop a corpus of user comments annotated for abusive language, the first of its kind. Finally, we use our detection tool to analyze abusive language over time and in different settings to further enhance our knowledge of this behavior.
更多
查看译文
关键词
NLP,Hate Speech,Abusive Language,Stylistic Classification,Discourse Classification
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要