GPU accelerated partial order multiple sequence alignment for long reads self-correction

2020 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)(2020)

引用 1|浏览76
暂无评分
摘要
As third generation sequencing technologies become more reliable and widely used to solve several genome-related problems, self-correction of long reads is becoming the preferred method to reduce the error rate of Pacific Biosciences and Oxford Nanopore long reads, that is now around 10-12%. Several of these self-correction methods rely on some form of Multiple Sequence Alignment (MSA) to obtain a consensus sequence for the original reads. In particular, error-correction tools such as RACON and CONSENT use Partial Order (PO) graph alignment to accomplish this task. PO graph alignment, which is computationally more expensive than optimal global pairwise alignment between two sequences, needs to be performed several times for each read during the error correction process. GPUs have proven very effective in accelerating several computeintensive tasks in different scientific fields. We harnessed the power of these architectures to accelerate the error correction process of existing self-correction tools, to improve the efficiency of this step of genome analysis.In this paper, we introduce a GPU-accelerated version of the PO alignment presented in the POA v2 software library, implemented on an NVIDIA Tesla V100 GPU. We obtain up to 6. 5x speedup compared to 64 CPU threads run on two 2.3 GHz 16-core Intel Xeon Processors E5-2698 v3. In our implementation we focused on the alignment of smaller sequences, as the CONSENT segmentation strategy based on k-mer chaining provides an optimal opportunity to exploit the parallel-processing power of GPUs. To demonstrate this, we have integrated our kernel in the CONSENT software. This accelerated version of CONSENT provides a speedup for the whole error correction step that ranges from 1. 95x to 8. 5x depending on the input reads.
更多
查看译文
关键词
PO graph alignment,optimal global pairwise alignment,error correction process,self-correction tools,GPU-accelerated version,PO alignment,NVIDIA Tesla V100 GPU,2.3 GHz 16-core Intel Xeon Processors E,error correction step,long reads self-correction,generation sequencing technologies,genome-related problems,preferred method,error rate,self-correction methods,consensus sequence,error-correction tools,GPU accelerated partial order multiple sequence alignment
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要