Tuning Graph2vec with Node Labels for Abuse Detection in Online Conversations (extended abstract).

MARAMI(2020)

引用 0|浏览21
暂无评分
摘要
In recent years, online social media have allowed people to meet and discuss world-wide. These popular platforms are confronted with increasing abusive content. In order to automate the detection of abusive content in such social media, researchers have proposed various methods based on Natural Language Processing (NLP), and have leveraged behavioral information about users and the structure of conversations. In our previous work, we proposed to combine NLP and conversational graph-based features to detect abusive messages in chat logs extracted from an online game. These conversational graphs model interactions between users (i.e. who is arguing with whom?), while completely ignoring the language content of the messages. We characterized the structure of these graphs by computing a large set of manually selected topological measures, and used them as features to train a classifier into detecting abusive messages. Graph embedding methods allow representing graphs as low-dimensional vectors while preserving at least a part of their topological properties. In addition to the plain structure, certain methods are able to capture additional information such as node labels or the weight and direction of edges. These representations are automatically learned, so they have the advantage of not requiring to perform any feature selection or feature engineering. One can distinguish four main categories of graph embedding methods, depending on the nature of the considered objects: node, edge, subgraph and whole-graph embeddings. Each category better fits the needs of different applications and problems. In this paper, we focus on the information that is used in addition to the plain structure by some embedding approaches. Especially, we study the impact of the node labels that are used by Graph2vec, a whole-graph embedding method. We study the effectiveness of such additional information in the context of online abuse detection.
更多
查看译文
关键词
abuse detection,graph2vec,online conversations,node labels
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要