Improving the inter-corpora compatibility for protein annotations.
Journal of Bioinformatics and Computational Biology(2010)
摘要
Although there are several corpora with protein annotation, incompatibility between the annotations in different corpora remains a problem that hinders the progress of automatic recognition of protein names in biomedical literature. Here, we report on our efforts to find a solution to the incompatibility issue, and to improve the compatibility between two representative protein-annotated corpora: the GENIA corpus and the GENETAG corpus. In a comparative study, we improve our insight into the two corpora, and a series of experimental results show that most of the incompatibility can be removed.
更多查看译文
关键词
protein annotations,inter-corpora
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络