BAMBI: Integrative biostatistical and artificial-intelligence method discover coding and non-coding RNA genes as biomarkers

Peng Zhou,Zixiu Li,Feifan Liu, EuiJin Kwon, Tien-Chan Hsieh,Shangyuan Ye,Shobha Vasudevan, JungAe Lee,Chan Zhou

biorxiv(2024)

引用 0|浏览5
暂无评分
摘要
Accurate disease diagnosis and prognosis are crucial for effective treatment management and improving patient outcomes. However, accurately detecting early signs of certain diseases or recurrence remains challenging. Existing machine-learning methods for identifying gene expression biomarkers have several limitations, including poor performance on independent test datasets, inability to directly process omics data, and difficulty in identifying noncoding RNA genes as biomarkers. Additionally, these methods may not provide sufficient biological interpretation of their results, and the panel biomarkers they identify may not be suitable for clinical application. To address these limitations, we have developed a new computational method called BAMBI, which integrates multiple machine-learning algorithms and statistical approaches to identify putative coding and noncoding genes as biomarkers for disease diagnosis and prognosis. We evaluated BAMBI ability to identify diagnostic and prognostic biomarkers by analyzing multiple RNA-seq datasets from cancerous and non-cancerous diseases at population levels. The results from BAMBI demonstrate significant biological interpretability and state-of-the-art prediction performance. When the singular gene identified by BAMBI is used as a diagnostic biomarker, it achieves a balance accuracy exceeding 95% in studies of both breast cancer and psoriasis. Additionally, the prognostic biomarkers that BAMBI identifies from RNA-seq data of Acute Myeloid Leukemia (AML) patients significantly correlate with the survival rates in an independent AML patient cohort. Additionally, BAMBI outperforms existing methods by delivering more robust results, identifying biomarkers with fewer genes, and simultaneously achieving superior prediction accuracy. We have implemented BAMBI into user-friendly software for the research community. In summary, BAMBI serves as a more reliable pipeline for identifying both coding and noncoding genes as biosignature markers, enhancing the accuracy of disease diagnosis and prognosis. BAMBI is available via https://github.com/CZhouLab/BAMBI. ### Competing Interest Statement The authors have declared no competing interest.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要