首页> 外文会议>IEEE International Symposium on Parallel Distributed Processing;IPDPS 2009 >Accelerating error correction in high-throughput short-read DNA sequencing data with CUDA
【24h】

Accelerating error correction in high-throughput short-read DNA sequencing data with CUDA

机译:使用CUDA加速高通量短读DNA测序数据中的错误校正

获取原文

摘要

Emerging DNA sequencing technologies open up exciting new opportunities for genome sequencing by generating read data with a massive throughput. However, produced reads are significantly shorter and more error-prone compared to the traditional Sanger shotgun sequencing method. This poses challenges for de-novo DNA fragment assembly algorithms in terms of both accuracy (to deal with short, error-prone reads) and scalability (to deal with very large input data sets). In this paper we present a scalable parallel algorithm for correcting sequencing errors in high-throughput short-read data. It is based on spectral alignment and uses the CUDA programming model. Our computational experiments on a GTX 280 GPU show runtime savings between 10 and 19 times (for different error-rates using simulated datasets as well as real Solexa/Illumina datasets).
机译:新兴的DNA测序技术通过产生大量读取数据,为基因组测序开辟了令人兴奋的新机遇。但是,与传统的Sanger shot弹枪测序方法相比,产生的读段明显更短且更容易出错。从准确性(以处理短的,容易出错的读段)和可伸缩性(以处理非常大的输入数据集)的角度来看,这对于新颖的DNA片段组装算法提出了挑战。在本文中,我们提出了一种可伸缩的并行算法,用于纠正高通量短读数据中的排序错误。它基于光谱对齐,并使用CUDA编程模型。我们在GTX 280 GPU上进行的计算实验表明,运行时间节省了10到19倍(对于使用模拟数据集和实际Solexa / Illumina数据集的不同错误率)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号