High Quality Phasing Using Linked-Read Whole Genome Sequencing of Patient Cohorts Informs Genetic Understanding of Complex Traits

biorxiv(2022)

引用 0|浏览21
暂无评分
摘要
Phasing of heterozygous alleles is critical for interpretation of cis-effects of disease-relevant variation. For population studies, phase is often inferred from external data but read-based phasing approaches that span long genomic distances would be more accurate because they enable both genotype and phase to be obtained from a single dataset. To demonstrate how read-based phasing can provide functional insights, we sequenced 477 individuals with Cystic Fibrosis (CF) using linked-read sequencing. We benchmark read-based phasing with different short- and long-read sequencing technologies, prioritize linked-read technology as the most informative and produce a benchmark phase call set from reference sample HG002 for the community. The 477 samples display an average phase block N50 of 4.39 Mb. We use these samples to construct a graph representation of CFTR haplotypes, which facilitates understanding of complex CF alleles. Fine-mapping and phasing of the chr7q35 trypsinogen locus associated with CF meconium ileus demonstrates a 20 kb deletion and a PRSS2 missense variant p.Thr8Ile (rs62473563) independently contribute to meconium ileus risk (p=0.0028, p=0.011, respectively) and are PRSS2 pancreas eQTLs (p=9.5e-7 and p=1.4e-4, respectively), explaining the mechanism by which these polymorphisms contribute to CF. Phase enables access to haplotypes that can be used for genome graph or reference panel construction, identification of cis-effects, and for understanding disease associated loci. The phase information from linked-reads provides a causal explanation for variation at a CF-relevant locus which also has implications for the genetic basis of non-CF pancreatitis to which this locus has been reported to contribute. ### Competing Interest Statement DMC received an honorarium for teaching module development for Vertex Pharmaceuticals. NM is doing contract research trials for Vertex Phaemaceuticals and Abbvie. ALS has received speaking fees for educational programs sponsored by Vertex Pharmaceuticals. BSQ has received speaker fees from Vertex Pharmaceuticals and has served as site PI for several Vertex-sponsored clinical trials. WML is a study investigator for Vertex Pharmaceuticals. ET and FR act as consultants for Vertex Pharmaceuticals. MS participated in Vertex clinical trials and received payment for education modules. SM, AC, JG, FL, BT, WWLS, JW, ZW, RVP, KK, AH, NP, JA, CW, GCM, SB, DA, EB, CB, MC, AP, MP, RVW, DH, MJS, ET, PW, LS, FR, and LJS have no conflicts of interest.
更多
查看译文
关键词
genetic understanding,patient cohorts informs,complex traits,linked-read
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要