Compact encodings for all local path information in web taxonomies with application to wordnet

SOFSEM(2006)

引用 4|浏览0
暂无评分
摘要
We consider the problem of finding a compact labelling for large, rooted web taxonomies that can be used to encode all local path information for each taxonomy element. This research is motivated by the problem of developing standards for taxonomic data, and addresses the data intensive problem of evaluating semantic similarities between taxonomic elements. Evaluating such similarities often requires the processing of large common ancestor sets between elements. We propose a new class of compact labelling schemes, designed for directed acyclic graphs, and tailored for applications to large web taxonomies. Our labelling schemes significantly reduce the complexity of evaluating similarities among taxonomy elements by enabling the gleaning of inferences from the labels alone, without searching the data structure. We provide an analysis of the label lengths for the proposed schemes based on structural properties of the taxonomy. Finally, we provide supporting empirical evidence for the quality of these schemes by evaluating the performance on the WordNet taxonomy.
更多
查看译文
关键词
labelling scheme,compact labelling scheme,large common ancestor set,large web taxonomy,rooted web taxonomy,compact encodings,wordnet taxonomy,taxonomy element,data intensive problem,local path information,data structure,compact labelling,directed acyclic graph,semantic similarity,empirical evidence
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要