Exploring high-quality microbial genomes by assembling short-reads with long-range connectivity

Research Square (Research Square)(2023)

引用 0|浏览0
暂无评分
摘要
Abstract Despite long-read sequencing enables to generate complete genomes of unculturable microbes, its high cost hinders its widespread application in large cohorts. An alternative method is to assemble short-reads with long-range connectivity, which can be a cost-effective way to generate high-quality microbial genomes. We developed Pangaea to improve metagenome assembly using short-reads with physical or virtual barcodes. It adopts a deep-learning-based binning algorithm to assemble the co-barcoded reads with similar sequence contexts and abundances to improve assemblies of high- and medium-abundance microbes. Pangaea also leverages a multi-thresholding reassembly strategy to refine assembly for low-abundance microbes. We benchmarked Pangaea with linked-reads and a combination of short- and long-reads from mock communities and human gut metagenomes. Pangaea achieved significantly higher contig continuity as well as more near-complete metagenome-assembled genomes (NCMAGs) than the existing assemblers. Pangaea was also observed to generate three complete and circular NCMAGs on the human gut microbiomes.
更多
查看译文
关键词
microbial genomes,high-quality,short-reads,long-range
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要