Multiscale analysis of pangenomes enables improved representation of genomic diversity for repetitive and clinically relevant genes

biorxiv(2023)

引用 3|浏览24
暂无评分
摘要
Advancements in sequencing technologies and assembly methods enable the regular production of high-quality genome assemblies characterizing complex regions. However, challenges remain in efficiently interpreting variation at various scales, from smaller tandem repeats to megabase rearrangements, across many human genomes. We present a PanGenome Research Tool Kit (PGR-TK) enabling analyses of complex pangenome structural and haplotype variation at multiple scales. We apply the graph decomposition methods in PGR-TK to the class II major histocompatibility complex demonstrating the importance of the human pangenome for analyzing complicated regions. Moreover, we investigate the Y-chromosome genes, DAZ1 / DAZ2 / DAZ3 / DAZ4 , of which structural variants have been linked to male infertility, and X-chromosome genes OPN1LW and OPN1MW linked to eye disorders. We further showcase PGR-TK across 395 complex repetitive medically important genes. This highlights the power of PGR-TK to resolve complex variation in regions of the genome that were previously too complex to analyze.
更多
查看译文
关键词
Genome assembly algorithms,Software,Structural variation,Life Sciences,general,Biological Techniques,Biological Microscopy,Biomedical Engineering/Biotechnology,Bioinformatics,Proteomics
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要