Stanford-UBC Entity Linking at TAC-KBP, Again.

TAC(2011)

引用 27|浏览52
暂无评分
摘要
This paper describes the joint Stanford-UBC knowledge base population system for the entity linking tasks. We participated in both the English and the cross-lingual tasks, using a dictionary from strings to possible Wikipedia titles, taken from our 2009 submission. This dictionary is based on frequencies of Wikipedia back-links, and it provides a strong context-independent baseline. For the English track, we improved on the results given by the dictionary by disambiguating entities using a distantly supervised classifier, trained on context extracted from Wikipedia. Since we did not use any text from the Wikipedia pages associated with the knowledge base nodes for the dictionary, we submitted that run to the no wiki text track, and the one using the distantly supervised classifier to the wiki text track. Our work focused on disambiguating among articles, allowing for very simple NIL strategies: the system returned NIL whenever selected Wikipedia articles were not present in the KB; moreover, NILs were then clustered only according to the target string. These simple approaches were sufficient for our runs to score above the median entry in each of their respective tracks for the English task; for the cross-lingual task, there was only one track, and our submissions (using the English-specific, context-independent dictionaries) fell below the median.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要