Implementation and performance analysis of efficient index structures for DNA search algorithms in parallel platforms

Nuno Sebastião; Gustavo Encarnação; Nuno Roma

首页> 外文期刊>Concurrency and computation: practice and experience >Implementation and performance analysis of efficient index structures for DNA search algorithms in parallel platforms

【24h】

Implementation and performance analysis of efficient index structures for DNA search algorithms in parallel platforms

机译：并行平台中DNA搜索算法高效索引结构的实现和性能分析

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Because of the large datasets that are usually involved in deoxyribonucleic acid (DNA) sequence alignment,rnthe use of optimal local alignment algorithms (e.g., Smith–Waterman) is often unfeasible in practicalrnapplications. As such, more efficient solutions that rely on indexed search procedures are often preferredrnto significantly reduce the time to obtain such alignments. Some data structures that are usually adopted tornbuild such indexes are suffix trees, suffix arrays, and the hash tables of q-mers.rnThis paper presents a comparative analysis of highly optimized parallel implementations of index-basedrnsearch algorithms using these three distinct data structures, considering two different parallel platforms: arnhomogeneous multi-core central processing unit (CPU) and a NVidia Fermi graphics processing unit (GPU).rnContrasting to what happens with CPU implementations, the obtained experimental results reveal that GPUrnimplementations clearly favor the suffix arrays, because of the achieved performance in terms of memoryrnaccesses. Furthermore, the results also reveal that both the suffix trees and suffix arrays outperform the hashrntables of q-mers when dealing with the largest datasets.rnWhen compared with a quad-core CPU, the results demonstrate the possibility to achieve speedups asrnhigh as 65 with the GPU when considering a suffix-array index, thus making it an adequate choice forrnhigh-performance bioinfomatics applications.

机译：由于脱氧核糖核酸（DNA）序列比对通常涉及大量数据集，因此在实际应用中使用最佳局部比对算法（例如Smith-Waterman）通常是不可行的。这样，通常优选依赖索引搜索过程的更有效的解决方案以显着减少获得这种对齐的时间。经常采用这种构建索引的数据结构包括后缀树，后缀数组和q-mers哈希表。本文对使用这三种不同数据结构的基于索引的搜索算法的高度优化并行实现进行了比较分析。两种不同的并行平台：同质多核中央处理器（CPU）和NVidia Fermi图形处理器（GPU）。rn与CPU实现的情况相反，获得的实验结果表明，GPU实现明显支持后缀数组，因为在内存访问方面实现了性能。此外，结果还显示，在处理最大数据集时，后缀树和后缀数组均优于q-mers的哈希表。与四核CPU相比，结果表明使用四核CPU时，加速比可高达65。 GPU在考虑后缀数组索引时，因此使其成为高性能生物信息学应用程序的充分选择。

著录项

来源
《Concurrency and computation: practice and experience》 |2015年第9期|2351-2368|共18页
作者
Nuno Sebastião; Gustavo Encarnação; Nuno Roma;
展开▼
作者单位

INESC-ID/IST, Rua Alves Redol, 9, Lisboa, Portugal;

INESC-ID/IST, Rua Alves Redol, 9, Lisboa, Portugal;

INESC-ID/IST, Rua Alves Redol, 9, Lisboa, Portugal;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
GPGPU; indexed search; bioinformatics;

机译：GPGPU;索引搜索;生物信息学;

相似文献

外文文献
中文文献
专利

1. Efficient Parallel Implementation of State Estimation Algorithms on Multicore Platforms [J] . Rosen O., Medvedev A. Control Systems Technology, IEEE Transactions on . 2013,第1期

机译：多核平台上状态估计算法的高效并行实现
2. Platform impact on performance of parallel genetic algorithms: Design and implementation considerations [J] . Tabitha L. James, Reza Barkhi, John D. Johnson Engineering Applications of Artificial Intelligence . 2006,第8期

机译：平台对并行遗传算法性能的影响：设计和实现注意事项
3. On the parallelization and performance analysis of Barnes–Hut algorithm using Java parallel platforms [J] . Badri Munier, Muhammad Aleem, Majid Khan, SN Applied Sciences . 2020,第4期

机译：使用Java并行平台进行Barnes-Hut算法的并行化和性能分析
4. Power efficient implementation of bit-parallel unrolled CORDIC structures for FPGA platforms [C] . Khurshid Burhan, Mir Roohie Naaz 2015 International Conference on VLSI Systems, Architecture, Technology and Applications . 2015

机译：针对FPGA平台的位并行展开的CORDIC结构的节能实现
5. Energy-efficient lightweight algorithms for embedded smart cameras: Design, implementation and performance analysis. [D] . Casares, Mauricio. 2014

机译：嵌入式智能相机的节能轻量算法：设计，实现和性能分析。
6. Highly efficient and exact method for parallelization of grid-based algorithms and its implementation in DelPhi [O] . Chuan Li, Lin Li, Jie Zhang, -1

机译：基于网格的算法并行化的高效和精确方法及其在Delphi中的实现
7. Efficient implementation of computationally intensive algorithms on parallel computing platforms [O] . Nemes Csaba 2014

机译：在并行计算平台上高效实现计算密集型算法

Implementation and performance analysis of efficient index structures for DNA search algorithms in parallel platforms

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅