A Robust Ensemble Regression Model for Reconstructing Genetic Networks.

IJCNN(2023)

引用 0|浏览5
暂无评分
摘要
Genetic networks contain important information about biological processes, including regulatory relationships and gene-gene interactions. Numerous methods, using high-dimensional gene expression data have been developed to capture these interactions. These gene expression data, generated using high-throughput technologies, are prone to noise. However, most existing network inference methods are unable to cope with noisy data, making genetic network reconstruction challenging. In this paper, we propose a novel ensemble regression model combining quantile regression and cross-validated Ridge regression, RidgeCV, to infer interactions from noisy gene expression data. The application of quantile regression to GRN inference is novel, and its design makes it appropriate for noisy data. RidgeCV also addresses other important issues, such as data overfitting and multicollinearity. First, each regression method is independently applied to gene expression data and the output of these methods, in the form of ranked gene lists, is aggregated using a novel gene score-based method by considering the gene rank and model importance. The model importance score is evaluated based on an adjusted coefficient of determination. This method implicitly includes majority voting by averaging each gene score value across all models. The proposed model was tested on the DREAM4 datasets and publicly available small-scale real-world network datasets. Experiments with noisy datasets showed that the proposed ensemble model is more accurate and efficient than other state-of-the-art methods.
更多
查看译文
关键词
Gene Regulatory Networks,ensemble model,quantile regression,cross-validated Ridge,noisy gene expression data,adjusted coefficient of determination
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要