Ontology Based Corpus Annotation and Tools

Genome Informatics(2001)

引用 28|浏览4
暂无评分
摘要
With the explosion of results in molecular biology there is an increased need for IE to extract knowledge to support database building and to search intelligently for information in online journal collections. We aim to build information extraction systems from biology papers and their abstracts available from the MEDLINE database[1, 3]. As a part of a project on information extraction from the research papers in biology domain, we are creating an expert-tagged corpus of MEDLINE abstracts, which will be used for training and testing the information extraction systems. In this paper, we outline the features of this new corpus, its ontological basis, our annotation scheme, and statistics of its annotated objects. We also show the tagging and tag management tools.
更多
查看译文
关键词
annotated corpus,natural language processing,xml,ontology
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要