k-Neighborhood decentralization: a comprehensive solution to index the UMLS for large scale knowledge discovery.
Journal of Biomedical Informatics(2012)
摘要
The Unified Medical Language System (UMLS) is the largest thesaurus in the biomedical informatics domain. Previous works have shown that knowledge constructs comprised of transitively-associated UMLS concepts are effective for discovering potentially novel biomedical hypotheses. However, the extremely large size of the UMLS becomes a major challenge for these applications. To address this problem, we designed a k-neighborhood Decentralization Labeling Scheme (kDLS) for the UMLS, and the corresponding method to effectively evaluate the kDLS indexing results. kDLS provides a comprehensive solution for indexing the UMLS for very efficient large scale knowledge discovery. We demonstrated that it is highly effective to use kDLS paths to prioritize disease-gene relations across the whole genome, with extremely high fold-enrichment values. To our knowledge, this is the first indexing scheme capable of supporting efficient large scale knowledge discovery on the UMLS as a whole. Our expectation is that kDLS will become a vital engine for retrieving information and generating hypotheses from the UMLS for future medical informatics applications.
更多查看译文
关键词
whole genome,efficient large scale knowledge,transitively-associated umls concept,large size,kdls path,k-neighborhood decentralization,comprehensive solution,large scale knowledge discovery,biomedical informatics domain,kdls indexing result,novel biomedical hypothesis,future medical informatics application,indexing scheme,medical informatics,umls,algorithms,politics,graph database,unified medical language system,knowledge discovery
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要