Accurate analysis of short read sequencing in complex genomes: A case study using QTL-seq to target blanchability in peanut (Arachis hypogaea)

biorxiv(2021)

引用 0|浏览15
暂无评分
摘要
Next Generation sequencing was a step change for molecular genetics and genomics. Illumina sequencing in particular still provides substantial value to animal and plant genomics. A simple yet powerful technique, referred to as QTL sequencing (QTL-seq) is susceptible to high levels of noise due to ambiguity of alignment of short reads in complex regions of the genome. This noise is particularly high when working with polyploid and/or outcrossing crop species, which impairs the efficacy of QTL-seq in identifying functional variation. By filtering loci based on the optimal alignment of short reads, we have developed a pipeline, named Khufu, that substantially improves the accuracy of QTL-seq analysis in complex genomes, allowing de novo variant discovery directly from bulk sequence. We first demonstrate the pipeline by identifying and validating loci contributing to blanching percentage in peanut using lines from multiple related populations. Using other published datasets in peanut, Brassica rapa, Hordeum volgare, Lactua satvia , and Felis catus , we demonstrate that Khufu produces more accurate results straight from bulk sequence. Khufu works across species, genome ploidy level, and data types. In cases where identified QTL were fine mapped, the fine mapped region corresponds to the top of the peak identified by Khufu. The accuracy of Khufu allows the analysis of population sequencing at very low coverage (<3x), greatly decreasing the amount of sequence needed to genotype even the most complex genomes. ### Competing Interest Statement The authors have declared no competing interest.
更多
查看译文
关键词
complex genomes,peanut,short read,arachis hypogaea,qtl-seq
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要