首页> 外文会议>Scientific and statistical database management >Integrating GPU-Accelerated Sequence Alignment and SNP Detection for Genome Resequencing Analysis
【24h】

Integrating GPU-Accelerated Sequence Alignment and SNP Detection for Genome Resequencing Analysis

机译:集成GPU加速序列比对和SNP检测以进行基因组重测序分析

获取原文
获取原文并翻译 | 示例

摘要

DNA sequence alignment and single-nucleotide polymorphism (SNP) detection are two important tasks in genomics research. A common genome resequencing analysis workflow is to first perform sequence alignment and then detect SNPs among the aligned sequences. In practice, the performance bottleneck in this workflow is usually the intermediate result I/O due to the separation of the two components, especially when the in-memory computation has been accelerated, e.g., by graphics processors. To address this bottleneck, we propose to integrate the two tasks tightly so as to eliminate the I/O of intermediate results in the workflow. Specifically, we make the following three changes for the tight integration: (1) we adopt a partition-based approach so that the external sorting of alignment results, which was required for SNP detection, is eliminated; (2) we perform customized compression on alignment results to reduce memory footprint; and (3) we move the computation of a global matrix from SNP detection to sequence alignment to save a file scan. We have developed a GPU-accelerated system that tightly integrates sequence alignment and SNP detection. Our results with human genome data sets show that our GPU-acceleration of individual components in the traditional workflow improves the overall performance by 18 times and that the tight integration further improves the performance of the GPU-accelerated system by 2.3 times.
机译:DNA序列比对和单核苷酸多态性(SNP)检测是基因组学研究中的两个重要任务。常见的基因组重测序分析工作流程是先执行序列比对,然后在比对的序列中检测SNP。实际上,由于两个组件的分离,该工作流中的性能瓶颈通常是中间结果I / O,尤其是在内存中计算已由图形处理器加速时。为了解决此瓶颈,我们建议将两个任务紧密集成在一起,以消除工作流程中中间结果的I / O。具体来说,我们针对紧密集成进行了以下三个更改:(1)我们采用基于分区的方法,从而消除了SNP检测所需的比对结果的外部排序; (2)对对齐结果执行自定义压缩,以减少内存占用; (3)将全局矩阵的计算从SNP检测转移到序列比对,以保存文件扫描。我们开发了GPU加速系统,该系统紧密集成了序列比对和SNP检测。我们对人类基因组数据集的结果表明,我们在传统工作流程中对单个组件的GPU加速将整体性能提高了18倍,而紧密的集成进一步将GPU加速系统的性能提高了2.3倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号