Exploiting deep transfer learning for the prediction of functional noncoding variants using genomic sequence

bioRxiv (Cold Spring Harbor Laboratory)（2022）

引用 0|浏览0

暂无评分

摘要

MotivationThough genome-wide association studies have identified tens of thousands of variants associated with complex traits and most of them fall within the noncoding regions, they may not the causal ones. The development of high-throughput functional assays leads to the discovery of experimental validated noncoding functional variants. However, these validated variants are rare due to technical difficulty and financial cost. The small sample size of validated variants makes it less reliable to develop a supervised machine learning model for achieving a whole genome-wide prediction of noncoding causal variants.ResultsWe will exploit a deep transfer learning model, which is based on convolutional neural network, to improve the prediction for functional noncoding variants. To address the challenge of small sample size, the transfer learning model leverages both large-scale generic functional noncoding variants to improve the learning of low-level features and context-specific functional noncoding variants to learn high-level features toward the contextspecific prediction task. By evaluating the deep transfer learning model on three MPRA datasets and 16 GWAS datasets, we demonstrate that the proposed model outperforms deep learning models without pretraining or retraining. In addition, the deep transfer learning model outperforms 18 existing computational methods in both MPRA and GWAS datasets.Availabilityhttps://github.com/lichen-lab/TLVarSupplementary InformationSupplementary data are available at Bioinformatics online.Contactchen61@iu.edu

查看译文

关键词

functional noncoding variants,transfer learning,deep transfer,genomic sequence

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要