Using an Unsupervised Neural Network to Detect and Categorize Offensive Language in Social Media

Emil Stefan Chifu,Viorica Rozina Chifu, Ana-Maria Costea

IFMBE Proceedings7th International Conference on Advancements of Medicine and Health Care through Technology(2022)

引用 0|浏览1
暂无评分
摘要
The offensive language in present social media could harm the mental health of the minors. The paper aims to identify the offensive content in the posts published on the Twitter social network. More concrete, we categorize the text content of the tweets according to the characteristics they meet: offensive vs. non-offensive tweets, non-targeted tweets vs. tweets targeted on someone, and, more specifically, tweets targeted on an individual, on a group of people, or else targeted towards another category (i.e. towards an organization, a situation, an event, or an issue). This multi-level hierarchical categorization behaves like a top-down decision tree process that classifies the tweets against a tree like ontological taxonomy. This is an offensiveness taxonomy, which defines the above mentioned offensive tweet categories. We use an unsupervised neural network for this hierarchical categorization, as applied on the OLID (Offensive Language Identification Dataset) data set. The OLID dataset consists of tweets, and it was offered as benchmark at Task 6 of the SemEval 2019 competition, a task which actually inspired our paper.
更多
查看译文
关键词
categorize offensive language,unsupervised neural network,neural network
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要