【24h】

FPGA-based PairHMM Forward Algorithm for DNA Variant Calling

机译:基于FPGA的PairHMM转发算法进行DNA变异调用

获取原文

摘要

One of the main objectives of human genetic research is the identification of DNA variations that may be involved in the development of rare diseases. Thanks to advances in DNA sequencing technologies and to a progressive integration of the available genetic databases, it is now possible to study not only common variants, but also ones occurring at very low frequencies in the population. Despite the presence of consolidated algorithms to perform the analysis of genetic data, the major hurdle is the impossibility to efficiently process the data, and translate them into biologically meaningful information. This prevents the current solutions and architectures from scaling to growing number of individuals, both for the discovery of new variants and the adoption of these methodologies to support diagnosis and treatment of illnesses in a common clinical setting. In this scenario, Field Programmable Gate Arrays (FPGAs) provide a viable alternative to conventional software-based approaches. In particular, they have already proved to efficiently manage huge workloads while reaching outstanding performance over power consumption scores. Therefore, the purpose of this work is exploring novel computing paradigms to tackle the limitations we are facing. In particular, we present an FPGA-based acceleration of the PairHMM Forward Algorithm, the performance bottleneck in the HaplotypeCaller, a variant calling tool in the popular Genome Analysis Toolkit (GATK). Our final architecture is able to achieve 2160x speedup when compared to the Original Java version in GATK, outperforming existing implementation on both CPUs, GPUs and FPGAs.
机译:人类遗传研究的主要目标之一是识别可能与罕见疾病发展有关的DNA变异。由于DNA测序技术的进步以及现有遗传数据库的逐步整合,现在不仅可以研究常见变异,而且还可以研究种群中极低频率的变异。尽管存在执行遗传数据分析的整合算法,但主要障碍是无法有效处理数据并将其转换为生物学上有意义的信息。这将阻止当前的解决方案和体系结构扩展到越来越多的个人,既用于发现新变体,又无法采用这些方法来支持在常见临床环境中诊断和治疗疾病。在这种情况下,现场可编程门阵列(FPGA)为传统的基于软件的方法提供了可行的替代方案。特别是,它们已经被证明可以有效地管理巨大的工作负载,同时在功耗分数上达到出色的性能。因此,这项工作的目的是探索新颖的计算范式,以解决我们面临的局限性。特别是,我们介绍了PairHMM转发算法的基于FPGA的加速,这是HaplotypeCaller中的性能瓶颈,HaplotypeCaller是流行的Genome Analysis Toolkit(GATK)中的变体调用工具。与GATK中的原始Java版本相比,我们的最终架构能够实现2160倍的加速,胜过CPU,GPU和FPGA上的现有实现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号