Genome-wide association study-based prediction of atrial fibrillation using artificial intelligence

OPEN HEART(2022)

引用 7|浏览5
暂无评分
摘要
Objective We previously reported early-onset atrial fibrillation (AF) associated genetic loci among a Korean population. We explored whether the AF-associated single-nudeotide polymorphisms (SNPs) selected from the Genome-Wide Association Study (GWAS) of an external large cohort has a prediction power for AF in Korean population through a convolutional neural network (CNN). Methods This study included 6358 subjects (872 cases, 5486 controls) from the Korean population GWAS data. We extracted the lists of SNPs at each p value threshold of the association statistics from three different previously reported ethnical-specific GWASs. The Korean GWAS data were divided into training (64%), validation (16%) and test (20%) sets, and a stratified K-fold cross-validation was performed and repeated five times after data shuffling. Results The CNN-GWAS predictive power for AF had an area under the curve (AUC) of 0.78 +/- 0.01 based on the Japanese GWAS, AUC of 0.79 +/- 0.01 based on the European GWAS, and AUC of 0.82 +/- 0.01 based on the multiethnic GWAS, respectively. Gradient-weighted dass activation mapping assigned high saliency scores for AF associated SNPs, and the PITX2 obtained the highest saliency score. The CNN-GWAS did not show AF prediction power by SNPs with non-significant p value subset (AUC 0.56 +/- 0.01) despite larger numbers of SNPs. The CNN-GWAS had no prediction power for odd-even registration numbers (AUC 0.51 +/- 0.01). Conclusions AF can be predicted by genetic information alone with moderate accuracy. The CNN-GWAS can be a robust and useful tool for detecting polygenic diseases by capturing the cumulative effects and genetic interactions of moderately associated but statistically significant SNPs.
更多
查看译文
关键词
atrial fibrillation,genetics,genome-wide association study
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要