Protecting marginalized communities by mitigating discrimination in toxic language detection
2021 IEEE International Symposium on Technology and Society (ISTAS)(2021)
摘要
As the harms of online toxic language become more apparent, countering online toxic behavior is an essential application of natural language processing. The first step in managing toxic language risk is identification, but algorithmic approaches have themselves demonstrated bias. Texts containing some demographic identity terms such as gay or Black are more likely to be labeled as toxic in existin...
更多查看译文
关键词
Training,Deep learning,Toxicology,Machine learning algorithms,Bit error rate,Predictive models,Natural language processing
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要