CUSHAW : a CUDA compatible short read aligner to large genomes based on the Burrows-Wheeler transform

Motivation: New high-throughput sequencing technologies have promoted the production of short reads with dramatically low unit cost. The explosive growth of short read datasets poses a challenge to the mapping of short reads to reference genomes, such as the human genome, in terms of alignment quali...

Full description

Saved in:
Bibliographic Details
Main Authors: Liu, Yongchao., Schmidt, Bertil., Maskell, Douglas Leslie.
Other Authors: School of Computer Engineering
Format: Article
Language:English
Published: 2013
Subjects:
Online Access:https://hdl.handle.net/10356/84328
http://hdl.handle.net/10220/10773
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-84328
record_format dspace
spelling sg-ntu-dr.10356-843282020-05-28T07:17:27Z CUSHAW : a CUDA compatible short read aligner to large genomes based on the Burrows-Wheeler transform Liu, Yongchao. Schmidt, Bertil. Maskell, Douglas Leslie. School of Computer Engineering DRNTU::Engineering::Computer science and engineering Motivation: New high-throughput sequencing technologies have promoted the production of short reads with dramatically low unit cost. The explosive growth of short read datasets poses a challenge to the mapping of short reads to reference genomes, such as the human genome, in terms of alignment quality and execution speed. Results: We present CUSHAW, a parallelized short read aligner based on the compute unified device architecture (CUDA) parallel programming model. We exploit CUDA-compatible graphics hardware as accelerators to achieve fast speed. Our algorithm uses a quality-aware bounded search approach based on the Burrows–Wheeler transform (BWT) and the Ferragina–Manzini index to reduce the search space and achieve high alignment quality. Performance evaluation, using simulated as well as real short read datasets, reveals that our algorithm running on one or two graphics processing units achieves significant speedups in terms of execution time, while yielding comparable or even better alignment quality for paired-end alignments compared with three popular BWT-based aligners: Bowtie, BWA and SOAP2. CUSHAW also delivers competitive performance in terms of single-nucleotide polymorphism calling for an Escherichia coli test dataset. 2013-06-27T03:04:46Z 2019-12-06T15:42:49Z 2013-06-27T03:04:46Z 2019-12-06T15:42:49Z 2012 2012 Journal Article Liu, Y., Schmidt, B., & Maskell, D. L. (2012). CUSHAW: a CUDA compatible short read aligner to large genomes based on the Burrows-Wheeler transform. Bioinformatics, 28(14), 1830-1837. 1460-2059 https://hdl.handle.net/10356/84328 http://hdl.handle.net/10220/10773 10.1093/bioinformatics/bts276 en Bioinformatics © 2012 The Author.
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
topic DRNTU::Engineering::Computer science and engineering
spellingShingle DRNTU::Engineering::Computer science and engineering
Liu, Yongchao.
Schmidt, Bertil.
Maskell, Douglas Leslie.
CUSHAW : a CUDA compatible short read aligner to large genomes based on the Burrows-Wheeler transform
description Motivation: New high-throughput sequencing technologies have promoted the production of short reads with dramatically low unit cost. The explosive growth of short read datasets poses a challenge to the mapping of short reads to reference genomes, such as the human genome, in terms of alignment quality and execution speed. Results: We present CUSHAW, a parallelized short read aligner based on the compute unified device architecture (CUDA) parallel programming model. We exploit CUDA-compatible graphics hardware as accelerators to achieve fast speed. Our algorithm uses a quality-aware bounded search approach based on the Burrows–Wheeler transform (BWT) and the Ferragina–Manzini index to reduce the search space and achieve high alignment quality. Performance evaluation, using simulated as well as real short read datasets, reveals that our algorithm running on one or two graphics processing units achieves significant speedups in terms of execution time, while yielding comparable or even better alignment quality for paired-end alignments compared with three popular BWT-based aligners: Bowtie, BWA and SOAP2. CUSHAW also delivers competitive performance in terms of single-nucleotide polymorphism calling for an Escherichia coli test dataset.
author2 School of Computer Engineering
author_facet School of Computer Engineering
Liu, Yongchao.
Schmidt, Bertil.
Maskell, Douglas Leslie.
format Article
author Liu, Yongchao.
Schmidt, Bertil.
Maskell, Douglas Leslie.
author_sort Liu, Yongchao.
title CUSHAW : a CUDA compatible short read aligner to large genomes based on the Burrows-Wheeler transform
title_short CUSHAW : a CUDA compatible short read aligner to large genomes based on the Burrows-Wheeler transform
title_full CUSHAW : a CUDA compatible short read aligner to large genomes based on the Burrows-Wheeler transform
title_fullStr CUSHAW : a CUDA compatible short read aligner to large genomes based on the Burrows-Wheeler transform
title_full_unstemmed CUSHAW : a CUDA compatible short read aligner to large genomes based on the Burrows-Wheeler transform
title_sort cushaw : a cuda compatible short read aligner to large genomes based on the burrows-wheeler transform
publishDate 2013
url https://hdl.handle.net/10356/84328
http://hdl.handle.net/10220/10773
_version_ 1681057041134649344