PGP1 personal genome assembly - a hybrid assembly dataset using ONT’s PromethION and PacBio’s HiFi sequencing

biorxiv(2021)

引用 0|浏览6
暂无评分
摘要
PGP1 is the first participant of Personal Genome Project. We present the PGP1’s chromosome-scale genome assembly. It was constructed using 255 Gb ultra-long PromethION reads and 97 Gb short paired-end reads. For reducing base calling errors, we corrected PromethION reads using 72 Gb PacBio HiFi reads. 327 Gb Hi-C chromosomal mapping data were utilized to maximize the assembly’s contiguity. PGP1’s contig assembly was 3.01 Gb in length comprising of 4,234 contigs with an N50 value of 33.8 Mb. After scaffolding with Hi-C data and extensive manual curation, we obtained a chromosome-scale assembly that represents 3,880 scaffolds with an N50 value of 142 Mb. From the Merqury assessment, PGP1 assembly achieved a high QV score of Q45.45. For a gene annotation, we predicted 106,789 genes with a liftover from the Gencode 38 and an assembly of transcriptome data. ### Competing Interest Statement The authors have declared no competing interest.
更多
查看译文
关键词
personal genome assembly,genome assembly,hybrid assembly dataset,sequencing
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要