Identification of informative genes and sub-pathways using Improved Differential Expression Analysis for Pathways (iDEAP) for cancer classification

Nurul Athirah Nasarudin,Mohd Saberi Mohamad,Zalmiyah Zakaria,Richard Sinnott, Fatma Al Jasmi, Noura Al Dhaheri

F1000Research(2023)

引用 0|浏览0
暂无评分
摘要
Background: Pathway-based analysis primarily focuses on sub-pathway-based analysis, which aids in understanding biological reactions. Several studies have found abnormalities in pathways caused by certain regions based on the etiology of diseases. The Differential Expression Analysis for Pathways (DEAP) method is one such sub-pathway-based analysis method, that identifies a local region perturbed by complex diseases based on larger pathway data. However, the method has low performance in identifying informative pathways and sub-pathways. Methods: In this paper we propose an improved DEAP (iDEAP) method for enhanced identification of perturbed sub-pathways that achieves higher performance in the detection of differentially expressed pathways. Firstly, a search algorithm adapted from the Detect Module from Seed Protein (DMSP) algorithm was implemented as part of the DEAP method to search for informative sub-pathways. Secondly, the relation among sub-pathways was taken into account by averaging the maximum absolute value for the DEAP score for the reaction among sub-pathways to support the efficient identification of informative pathways. Three gene expression data sets were applied to this research. Results: The proposed improved method shows a better performance over the previous methods. In fact, when the identified genes from the results were assessed using 10-fold cross validation to classify cancer based on accuracy, the improved method shows higher accuracy for colorectal cancer (90%) and breast cancer (94%). Conclusions: This shows that the proposed method effectively identifies informative genes related to the targeted phenotype. A biological validation was also conducted on the top five significant pathways and selected genes based on biological literature. The results from this analysis will be useful especially in the medical field for disease detection. In 10 years and beyond, computational biology will become ever more entwined with biomedical research and medicine.
更多
查看译文
关键词
improved differential expression analysis,informative genes,expression analysis,sub-pathways
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要