Network New Word Discovery Framework Based on Sentence Semantic Vector Similarity

GanFeng Yu,Yue Feng Ma, Yang Song

2022 IEEE 34th International Conference on Tools with Artificial Intelligence (ICTAI)(2022)

引用 0|浏览2
暂无评分
摘要
New word discovery is a key problem in text information retrieval technology. Methods in new word discovery are often closely related to words. Because their target is words, the findings are obtained by designing methods to analyze words. With the popularity of social networks, individual netizens and online self-media have generated various network texts for the convenience of online life, including network new words that are far from standard Chinese expression. How detect network new words is one of the important goals in the field of new word discovery today. In this paper, we integrate the word embedding model and clustering methods to propose a network new word discovery framework based on sentence semantic similarity (S3-N2WD) to detect network new words effectively from the network texts. This framework constructs sentence semantic vectors through a distributed representation model, uses the similarity of sentence semantic vectors to determine the semantic relationship between sentences, and finally realizes new network word discovery by the meaning of semantic replacement between sentences. The experiment verifies that the framework not only completes the rapid discovery of network new words but also realizes the standard word meaning of the discovery of it, which reflects the effectiveness of our work.
更多
查看译文
关键词
New word discovery,Text information retrieval,Natural language processing,Information extraction
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要