首页> 外文学位 >FPGA acceleration of sequence analysis tools in bioinformatics.
【24h】

FPGA acceleration of sequence analysis tools in bioinformatics.

机译:FPGA加速了生物信息学中的序列分析工具。

获取原文
获取原文并翻译 | 示例

摘要

With advances in biotechnology and computing power, biological data are being produced at an exceptional rate. The purpose of this study is to analyze the application of FPGAs to accelerate high impact production biosequence analysis tools. Compared with other alternatives, FPGAs offer huge compute power, lower power consumption, and reasonable flexibility.;BLAST has become the de facto standard in bioinformatic approximate string matching and so its acceleration is of fundamental importance. It is a complex highly-optimized system, consisting of tens of thousands of lines of code and a large number of heuristics. Our idea is to emulate the main phases of its algorithm on FPGA. Utilizing our FPGA engine, we quickly reduce the size of the database to a small fraction, and then use the original code to process the query. Using a standard FPGA-based system, we achieved 12x speedup over a highly optimized multithread reference code.;Multiple Sequence Alignment (MSA)---the extension of pairwise Sequence Alignment to multiple Sequences---is critical to solve many biological problems. Previous attempts to accelerate Clustal-W, the most commonly used MSA code, have directly mapped a portion of the code to the FPGA. We use a new approach: we apply prefiltering of the kind commonly used in BLAST to perform the initial all-pairs alignments. This results in a speedup of from 80x to 190x over the CPU code (8 cores). The quality is comparable to the original according to a commonly used benchmark suite evaluated with respect to multiple distance metrics.;The challenge in FPGA-based acceleration is finding a suitable application mapping. Unfortunately many software heuristics do not fall into this category and so other methods must be applied. One is restructuring: an entirely new algorithm is applied. Another is to analyze application utilization and develop accuracy/performance tradeoffs. Using our prefiltering approach and novel FPGA programming models we have achieved significant speedup over reference programs. We have applied approximation, seeding, and filtering to this end. The bulk of this study is to introduce the pros and cons of these acceleration models for biosequence analysis tools.
机译:随着生物技术和计算能力的发展,生物数据正以惊人的速度产生。这项研究的目的是分析FPGA的应用,以加速产生高影响力的生产生物序列分析工具。与其他替代产品相比,FPGA具有巨大的计算能力,更低的功耗和合理的灵活性。BLAST已成为生物信息近似字符串匹配中的事实上的标准,因此其加速至关重要。它是一个复杂的高度优化的系统,由成千上万的代码行和大量的启发式方法组成。我们的想法是在FPGA上模拟其算法的主要阶段。利用我们的FPGA引擎,我们迅速将数据库的大小减小了一小部分,然后使用原始代码来处理查询。使用基于FPGA的标准系统,我们在高度优化的多线程参考代码上实现了12倍的加速。多序列比对(MSA)-将逐对序列比对扩展到多个序列-对于解决许多生物学问题至关重要。先前加速Clustal-W(最常用的MSA代码)的尝试已将部分代码直接映射到FPGA。我们使用一种新的方法:我们应用BLAST中常用的那种预过滤来执行初始所有对比对。这样可使CPU代码(8个内核)的速度从80倍提高到190倍。根据针对多个距离指标进行评估的常用基准套件,其质量可与原始质量相媲美。基于FPGA的加速所面临的挑战是找到合适的应用程序映射。不幸的是,许多软件启发式方法不属于此类,因此必须采用其他方法。一种是重组:应用了一种全新的算法。另一个是分析应用程序利用率并开发准确性/性能折衷。使用我们的预过滤方法和新颖的FPGA编程模型,我们已经大大提高了参考程序的速度。为此,我们已经应用了近似,播种和过滤。这项研究的主要内容是介绍这些加速模型在生物序列分析工具中的利弊。

著录项

  • 作者

    Mahram, Atabak.;

  • 作者单位

    Boston University.;

  • 授予单位 Boston University.;
  • 学科 Engineering Computer.;Biology Bioinformatics.
  • 学位 Ph.D.
  • 年度 2013
  • 页码 187 p.
  • 总页数 187
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号