Data Incompleteness May form a Hard-to-Overcome Barrier to Decoding Life's Mechanism

BIOLOGY-BASEL(2022)

引用 4|浏览1
暂无评分
摘要
Simple Summary The influence of data incompleteness on the correctness of conclusions about the structure and functions of the objects under study is widely discussed in the literature. It was noted that even a small percentage of missing data can lead to incorrect conclusions and imperfect knowledge. In particular, incompleteness can lead to critical errors in the qualitative and quantitative assessments of interactions in biological systems and a distorted understanding of the functioning mechanisms of living systems. In this brief review, we attempt to demonstrate the extent of this incompleteness in functional information about living systems using the best-studied examples. We suggest that this incompleteness may form seemingly insurmountable barriers in deciphering the mechanisms of the functioning of complex systems with unpredictable properties arising from the interaction of the system components. In this brief review, we attempt to demonstrate that the incompleteness of data, as well as the intrinsic heterogeneity of biological systems, may form very strong and possibly insurmountable barriers for researchers trying to decipher the mechanisms of the functioning of live systems. We illustrate this challenge using the two most studied organisms: E. coli, with 34.6% genes lacking experimental evidence of function, and C. elegans, with identified proteins for approximately 50% of its genes. Another striking example is an artificial unicellular entity named JCVI-syn3.0, with a minimal set of genes. A total of 31.5% of the genes of JCVI-syn3.0 cannot be ascribed a specific biological function. The human interactome mapping project identified only 5-10% of all protein interactions in humans. In addition, most of the available data are static snapshots, and it is barely possible to generate realistic models of the dynamic processes within cells. Moreover, the existing interactomes reflect the de facto interaction but not its functional result, which is an unpredictable emerging property. Perhaps the completeness of molecular data on any living organism is beyond our reach and represents an unsolvable problem in biology.
更多
查看译文
关键词
bioinformatics,big data,genome,systems biology,complexity
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要