首页> 外文期刊>Journal of computational biology >A Parallel Algorithm for Error Correction in High-Throughput Short-Read Data on CUDA-Enabled Graphics Hardware
【24h】

A Parallel Algorithm for Error Correction in High-Throughput Short-Read Data on CUDA-Enabled Graphics Hardware

机译:启用CUDA的图形硬件上的高通量短读数据中的纠错并行算法

获取原文
           

摘要

Abstract Emerging DNA sequencing technologies open up exciting new opportunities for genome sequencing by generating read data with a massive throughput. However, produced reads are significantly shorter and more error-prone compared to the traditional Sanger shotgun sequencing method. This poses challenges for de novo DNA fragment assembly algorithms in terms of both accuracy (to deal with short, error-prone reads) and scalability (to deal with very large input data sets). In this article, we present a scalable parallel algorithm for correcting sequencing errors in high-throughput short-read data so that error-free reads can be available before DNA fragment assembly, which is of high importance to many graph-based short-read assembly tools. The algorithm is based on spectral alignment and uses the Compute Unified Device Architecture (CUDA) programming model. To gain efficiency we are taking advantage of the CUDA texture memory using a space-efficient Bloom filter data structure for spectrum membership qu..." /> rel="meta" type="application/atom+xml" href="http://dx.doi.org/10.1089%2Fcmb.2009.0062" /> rel="meta" type="application/rdf+json" href="http://dx.doi.org/10.1089%2Fcmb.2009.0062" /> rel="meta" type="application/unixref+xml" href="http://dx.doi.org/10.1089%2Fcmb.2009.0062" /> 展开▼
机译:摘要新兴的DNA测序技术通过产生大量读取数据,为基因组测序带来了令人兴奋的新机遇。但是,与传统的Sanger shot弹枪测序方法相比,产生的读段明显更短且更容易出错。这对从头DNA片段组装算法提出了挑战,即准确性(以处理短的,容易出错的读取)和可伸缩性(以处理非常大的输入数据集)。在本文中,我们提出了一种可伸缩的并行算法,用于校正高通量短读数据中的测序错误,以便在DNA片段组装之前可以进行无错读取,这对于许多基于图的短读组装非常重要工具。该算法基于频谱对齐,并使用Compute Unified Device Architecture(CUDA)编程模型。为了提高效率,我们利用CUDA纹理存储器,利用空间高效的Bloom过滤器数据结构来实现频谱隶属关系分析。“ ... <元名称=“ dc.Format” content =“文本/ HTML” /> <元名称=“ dc。标识符” scheme =“ publisher-id” content =“ 10.1089 / cmb.2009.0062” /> rel =“ meta” type =“ application / atom + xml“ href =” http://dx.doi.org/10.1089%2Fcmb.2009.0062“ /> rel =” meta“ type =” application / rdf + json“ href =” http:// dx。 doi.org/10.1089%2Fcmb.2009.0062“ /> rel =” meta“ type =” application / unixref + xml“ href =” http://dx.doi.org/10.1089%2Fcmb.2009.0062“ /> < meta name =“ MSSmartTagsPreventParsing” content =“ true

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号