Synonym recognition from short texts: A self-supervised learning approach

EXPERT SYSTEMS WITH APPLICATIONS(2023)

引用 1|浏览16
暂无评分
摘要
Synonyms refer to different expressions for the same entity in the text and affect entity-centric text mining research performance. Therefore, synonym recognition has become a promising research topic in recent years. However, most existing approaches are based on structured, semi-structured, or long text, and only a few studies have tackled synonym recognition in short texts on social networks. Synonyms recognition in short texts confronts several research challenges. First, there are a large number of unlabeled synonyms in the short texts. Second, many new words will appear in short text on social networks. Therefore, in this paper, we propose a self-supervised learning method to recognize synonyms in short texts, which consists of two steps. First, we use a clustering algorithm to generate a pseudo-label for expression. Second, we input the co-occurrence information and the character information of the expressions into a deep-learning model to obtain the feature representation of the expression. The two steps are executed iteratively until the algorithm converges. To demonstrate the effectiveness of the proposed method, we conducted extensive experiments on a real short-text dataset, and the results suggest the effectiveness of our proposal.
更多
查看译文
关键词
Self-supervised,Synonyms,Short-text,Clustering
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要