A draft Arab pangenome reference

Mohammed Uddin,Nasna Nassir, Mohamed A. Almarri, Muhammad Kumail,Nesrin Mohamed, Bipin Balan, Sajid Hanif, Maryam Alobathani, Bassam Adnan Jamalalail, Hanan Abdelhalim ElSokary, Dasuki Kondaramage, Suhana Shiyas, Noor Kosaji, Dharana Satsangi, Madiha Abdelmotagali,Ahmad Abou Tayoun, Osama A. A. Ahmed, Doaa Mohammed Youssef,Hanan Al Suwaidi,Ammar Albanna,Stéfan du Plessis,Hamda Khansaheb,Alawi A. Alsheikh‐Ali

Research Square (Research Square)(2023)

引用 0|浏览4
暂无评分
摘要
Human pangenomes provide a comprehensive portrayal of genetic diversity of humans, yet it lacks representation of Arab populations. We constructed the Arab Pangenome Reference (APR) from 43 individuals with diverse Arab ethnicities. Nuclear and mitochondrial pangenomes were constructed utilizing 35.52X High fidelity long reads and 53.54X ultra-long reads. This yielded high-quality contiguous (average N50=106.81 Mb) de novo assemblies that used over 99% of the sequences constructing haplotype phased diploid genome assemblies with 88% exhibited larger genome length (average 3.01 gigabase) than the prevailing human reference GRCh38. We discovered 100.93 million base pairs of novel euchromatic sequences that were not present in recent human pangenomes and in the human genome references (T2T-CHM13 and GRCh38). We identified 10.68 million population-specific small variants, 108,709 structural variants, and 838 genes (13.24% recessive disease genes) duplication from the Arab pangenome. On exploring the mitochondria pangenome, we uncovered 718 bp of novel sequences. Our study provides a valuable resource for future genetic research and genomic medicine initiatives in the Arab populations and other populations with similar genetic backgrounds.
更多
查看译文
关键词
arab pangenome reference
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要