FPGA acceleration of sequence analysis tools in bioinformatics.

机译：FPGA加速了生物信息学中的序列分析工具。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

With advances in biotechnology and computing power, biological data are being produced at an exceptional rate. The purpose of this study is to analyze the application of FPGAs to accelerate high impact production biosequence analysis tools. Compared with other alternatives, FPGAs offer huge compute power, lower power consumption, and reasonable flexibility.;BLAST has become the de facto standard in bioinformatic approximate string matching and so its acceleration is of fundamental importance. It is a complex highly-optimized system, consisting of tens of thousands of lines of code and a large number of heuristics. Our idea is to emulate the main phases of its algorithm on FPGA. Utilizing our FPGA engine, we quickly reduce the size of the database to a small fraction, and then use the original code to process the query. Using a standard FPGA-based system, we achieved 12x speedup over a highly optimized multithread reference code.;Multiple Sequence Alignment (MSA)---the extension of pairwise Sequence Alignment to multiple Sequences---is critical to solve many biological problems. Previous attempts to accelerate Clustal-W, the most commonly used MSA code, have directly mapped a portion of the code to the FPGA. We use a new approach: we apply prefiltering of the kind commonly used in BLAST to perform the initial all-pairs alignments. This results in a speedup of from 80x to 190x over the CPU code (8 cores). The quality is comparable to the original according to a commonly used benchmark suite evaluated with respect to multiple distance metrics.;The challenge in FPGA-based acceleration is finding a suitable application mapping. Unfortunately many software heuristics do not fall into this category and so other methods must be applied. One is restructuring: an entirely new algorithm is applied. Another is to analyze application utilization and develop accuracy/performance tradeoffs. Using our prefiltering approach and novel FPGA programming models we have achieved significant speedup over reference programs. We have applied approximation, seeding, and filtering to this end. The bulk of this study is to introduce the pros and cons of these acceleration models for biosequence analysis tools.

机译：随着生物技术和计算能力的发展，生物数据正以惊人的速度产生。这项研究的目的是分析FPGA的应用，以加速产生高影响力的生产生物序列分析工具。与其他替代产品相比，FPGA具有巨大的计算能力，更低的功耗和合理的灵活性。BLAST已成为生物信息近似字符串匹配中的事实上的标准，因此其加速至关重要。它是一个复杂的高度优化的系统，由成千上万的代码行和大量的启发式方法组成。我们的想法是在FPGA上模拟其算法的主要阶段。利用我们的FPGA引擎，我们迅速将数据库的大小减小了一小部分，然后使用原始代码来处理查询。使用基于FPGA的标准系统，我们在高度优化的多线程参考代码上实现了12倍的加速。多序列比对（MSA）-将逐对序列比对扩展到多个序列-对于解决许多生物学问题至关重要。先前加速Clustal-W（最常用的MSA代码）的尝试已将部分代码直接映射到FPGA。我们使用一种新的方法：我们应用BLAST中常用的那种预过滤来执行初始所有对比对。这样可使CPU代码（8个内核）的速度从80倍提高到190倍。根据针对多个距离指标进行评估的常用基准套件，其质量可与原始质量相媲美。基于FPGA的加速所面临的挑战是找到合适的应用程序映射。不幸的是，许多软件启发式方法不属于此类，因此必须采用其他方法。一种是重组：应用了一种全新的算法。另一个是分析应用程序利用率并开发准确性/性能折衷。使用我们的预过滤方法和新颖的FPGA编程模型，我们已经大大提高了参考程序的速度。为此，我们已经应用了近似，播种和过滤。这项研究的主要内容是介绍这些加速模型在生物序列分析工具中的利弊。

著录项

作者
Mahram, Atabak.;
展开▼
作者单位

Boston University.;

展开▼
授予单位 Boston University.;
学科 Engineering Computer.;Biology Bioinformatics.
学位 Ph.D.
年度 2013
页码 187 p.
总页数 187
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Efficient and Versatile FPGA Acceleration of Support Counting for Stream Mining of Sequences and Frequent Itemsets [J] . Prost-Boucle Adrien, Petrot Frederic, Leroy Vincent, ACM transactions on reconfigurable technology and systems . 2017,第3期

机译：高效，多功能的FPGA加速支持计数，用于序列和频繁项集的流挖掘
2. Rice bioinformatics. Analysis of rice sequence data and leveraging the data to other plant species [J] . Yuan QP., Sultana R., Pertea M., Plant physiology . 2001,第3期

机译：水稻生物信息学。分析水稻序列数据并将其用于其他植物物种
3. DATE 2011 Workshop Design Methods and Tools for FPGA-Based Acceleration of Scientific Computing [J] . Computing and informatics . 2011,第1期

机译：DATE 2011研讨会基于FPGA的科学计算加速的设计方法和工具
4. FPGA acceleration of the pair-HMMs forward algorithm for DNA sequence analysis [C] . Ren Shanshan, Sima Vlad-Mihai, Al-Ars Zaid IEEE International Conference on Bioinformatics and Biomedicine . 2015

机译：对HMM的FPGA正向算法的DNA序列分析加速
5. VHDL Auto-Generation Tool for Optimized Hardware Acceleration of Convolutional Neural Networks on FPGA (VGT) [D] . Hamdan, Muhammad K.A. 2018

机译：VHDL自动生成工具，用于在FPGA（VGT）上优化卷积神经网络的硬件加速
6. Rice Bioinformatics. Analysis of Rice Sequence Data and Leveraging the Data to Other Plant Species [O] . Q. Yuan, J. Quackenbush, R. Sultana, 2001

机译：水稻生物信息学。水稻序列数据分析并利用该数据用于其他植物
7. Rice Bioinformatics. Analysis of Rice Sequence Data and Leveraging the Data to Other Plant Species [O] . Yuan, Q., Quackenbush, J., Sultana, R., 2001

机译：水稻生物信息学。水稻序列数据分析并利用该数据用于其他植物

FPGA acceleration of sequence analysis tools in bioinformatics.

摘要

著录项

相似文献

相关主题

期刊订阅