Unsupervised Name Ambiguity Resolution Using A Generative Model.
EMNLP '11: Proceedings of the First Workshop on Unsupervised Learning in NLP(2011)
摘要
Resolving ambiguity associated with names found on the Web, Wikipedia or medical texts is a very challenging task, which has been of great interest to the research community. We propose a novel approach to disambiguating names using Latent Dirichlet Allocation, where the learned topics represent the underlying senses of the ambiguous name. We conduct a detailed evaluation on multiple data sets containing ambiguous person, location and organization names and for multiple languages such as English, Spanish, Romanian and Bulgarian. We conduct comparative studies with existing approaches and show a substantial improvement of 15 to 35% in task accuracy.
更多查看译文
关键词
ambiguous name,ambiguous person,challenging task,multiple data,multiple language,task accuracy,Latent Dirichlet Allocation,Resolving ambiguity,comparative study,detailed evaluation,Unsupervised name ambiguity resolution,generative model
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络