Accelerating error correction in high-throughput short-read DNA sequencing data with CUDA

机译：使用CUDA加速高通量短读DNA测序数据中的错误校正

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Emerging DNA sequencing technologies open up exciting new opportunities for genome sequencing by generating read data with a massive throughput. However, produced reads are significantly shorter and more error-prone compared to the traditional Sanger shotgun sequencing method. This poses challenges for de-novo DNA fragment assembly algorithms in terms of both accuracy (to deal with short, error-prone reads) and scalability (to deal with very large input data sets). In this paper we present a scalable parallel algorithm for correcting sequencing errors in high-throughput short-read data. It is based on spectral alignment and uses the CUDA programming model. Our computational experiments on a GTX 280 GPU show runtime savings between 10 and 19 times (for different error-rates using simulated datasets as well as real Solexa/Illumina datasets).

机译：新兴的DNA测序技术通过产生大量读取数据，为基因组测序开辟了令人兴奋的新机遇。但是，与传统的Sanger shot弹枪测序方法相比，产生的读段明显更短且更容易出错。从准确性（以处理短的，容易出错的读段）和可伸缩性（以处理非常大的输入数据集）的角度来看，这对于新颖的DNA片段组装算法提出了挑战。在本文中，我们提出了一种可伸缩的并行算法，用于纠正高通量短读数据中的排序错误。它基于光谱对齐，并使用CUDA编程模型。我们在GTX 280 GPU上进行的计算实验表明，运行时间节省了10到19倍（对于使用模拟数据集和实际Solexa / Illumina数据集的不同错误率）。

著录项

来源
《IEEE International Symposium on Parallel Distributed Processing;IPDPS 2009》|2009年|1-8|共8页
会议地点 Rome(IT);Rome(IT)
作者
Haixiang Shi; Schmidt B.; Weiguo Liu; Muller-Wittig W.;
展开▼
作者单位

Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
DNA; biology computing; genomics; molecular biophysics; parallel algorithms; CUDA programming model; GTX 280 GPU; error correction; genome sequencing; high-throughput short-read DNA sequencing data; scalable parallel algorithm;

机译：DNA;生物学计算;基因组学;分子生物物理学;并行算法; CUDA编程模型; GTX 280 GPU;纠错;基因组测序;高通量短读DNA测序数据;可扩展并行算法;

相似文献

外文文献
中文文献
专利

1. A Parallel Algorithm for Error Correction in High-Throughput Short-Read Data on CUDA-Enabled Graphics Hardware [J] . Haixiang Shi Bertil Schmidt Weiguo Liu, Wolfgang Müller-Wittig Journal of computational biology . 2010,第4期

机译：启用CUDA的图形硬件上的高通量短读数据中的纠错并行算法
2. Quality-score guided error correction for short-read sequencing data using CUDA [J] . Haixiang Shi, Bertil Schmidt, Weiguo Liu, Procedia Computer Science . 2010,第1期

机译：使用CUDA对短读序列数据进行质量得分指导的纠错
3. Denoising DNA deep sequencing data-high-throughput sequencing errors and their correction [J] . Laehnemann David, Borkhardt Arndt, McHardy Alice Carolyn Briefings in bioinformatics . 2016,第1期

机译：对DNA深度测序数据进行去噪-高通量测序错误及其纠正
4. Accelerating Error Correction in High-Throughput Short-Read DNA Sequencing Data with CUDA [C] . Haixiang Shi, Bertil Schmidt, Weiguo Liu, International Symposium on Parallel Distributed Processing . 2009

机译：用CUDA加速高吞吐量短读取DNA测序数据的纠错
5. Calculation of variance and covariance of sampling errors in complex mineral processing systems and correction of these errors by using data reconciliation [D] . Mirabedini, Azar. 1996

机译：计算复杂矿物处理系统中采样误差的方差和协方差，并使用数据对账纠正这些误差
6. Denoising DNA deep sequencing data—high-throughput sequencing errors and their correction [O] . David Laehnemann, *, Arndt Borkhardt, -1

机译：对DNA深度测序数据进行去噪-高通量测序错误及其更正
7. Accelerating Error Correction in High-Throughput Short-Read DNA Sequencing Data with CUDA [O] . Haixiang Shi, Bertil Schmidt, Weiguo Liu, 2010

机译：使用CUDA加速高通量短读DNA测序数据中的错误校正

Accelerating error correction in high-throughput short-read DNA sequencing data with CUDA

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅