An OOV Recognition Based Approach to Detecting Sensitive Information in Dialogue Texts of Electric Power Customer Services

Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications EngineeringArtificial Intelligence for Communications and Networks(2021)

引用 0|浏览0
暂无评分
摘要
Sensitive word recognition technology is of great significance to the protection of enterprise privacy data. In electric power custom services systems, the dialogue texts recording the conversational information between electric power customers and the customer services staffs contain some sensitive information of electric power customers. However, the colloquialism and synonyms in dialogue texts often make sensitive information recognition more difficult. In this paper, we proposed an out-of-vocabulary (OOV) approach for recognizing sensitive words in the dialogue texts of electric power customer services. We combine the semantic similarity based on word embeddings and structural semantic similarity based on HowNet for recognizing sensitive OOV words in the dialogue texts. The related experiments were made, and the experimental results show that our method has higher recognition accuracy in comparison with the popular approaches.
更多
查看译文
关键词
Out-of-vocabulary, Sensitive word recognition, HowNet, Word embedding, Electric power customer services
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要