Webchild: Harvesting And Organizing Commonsense Knowledge From The Web

WSDM(2014)

引用 177|浏览112
暂无评分
摘要
This paper presents a method for automatically constructing a large commonsense knowledge base, called WebChild(1), from Web contents. WebChild contains triples that connect nouns with adjectives via fine-grained relations like hasShape, hasTaste, evokesEmotion, etc. The arguments of these assertions, nouns and adjectives, are disambiguated by mapping them onto their proper WordNet senses. Our method is based on semi-supervised Label Propagation over graphs of noisy candidate assertions. We automatically derive seeds from WordNet and by pattern matching from Web text collections. The Label Propagation algorithm provides us with domain sets and range sets for 19 different relations, and with confidence-ranked assertions betweenWordNet senses. Large-scale experiments demonstrate the high accuracy (more than 80 percent) and coverage (more than four million fine grained disambiguated assertions) of WebChild.
更多
查看译文
关键词
Knowledge Bases,Commonsense Knowledge,Web Mining,Label Propagation,Word Sense Disambiguation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要