GeneConnector: Unlocking the full potential of Genbank metadata

IEEE LATIN AMERICA TRANSACTIONS(2024)

引用 0|浏览17
暂无评分
摘要
Genbank currently stands as one of the most significant global repositories of genetic information. However, despite its vast quantity and diversity of data, a considerable portion of the existing records suffer from disjointed and often lacking metadata, failing to provide the necessary context of their acquisition. In light of this, we propose GeneConnector, a tool that harnesses shared information among multiple records of the same specimen in Genbank, aiming to enhance the completeness of poorly annotated nodes across various information domains. To demonstrate the tools capabilities, we conducted a comprehensive review and aggregation of available data using the Genbank database of Genera of Phytopathogenic Fungi (GOPHY). Through our evaluation, we observed substantial gains in information by analyzing shared data among nodes connecting Genbank specimen records, resulting in impressive increments ranging from 2% to a remarkable 60%. Our approach empowers users to make precise, straightforward, and accurate assessments of the context associated to results, facilitated by two metrics that gauge the current level of data annotation and the potential information gain achievable following our evaluation.
更多
查看译文
关键词
Metadata,Databases,Genomics,Bioinformatics,Codes,Taxonomy,Organisms,Genbank,NCBI,mycology,phytopathology,GOPHY,genomics
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要