Design of high-performance parallelized gene predictors in MATLAB

Sylvain Robert Rivard; Jean-Gabriel Mailloux; Rachid Beguenane; Hung Tien Bui

首页> 外文期刊>BMC research notes >Design of high-performance parallelized gene predictors in MATLAB

【24h】

Design of high-performance parallelized gene predictors in MATLAB

机译：MATLAB中高性能并行基因预测器的设计

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Background This paper proposes a method of implementing parallel gene prediction algorithms in MATLAB. The proposed designs are based on either Goertzel’s algorithm or on FFTs and have been implemented using varying amounts of parallelism on a central processing unit (CPU) and on a graphics processing unit (GPU). Findings Results show that an implementation using a straightforward approach can require over 4.5 h to process 15 million base pairs (bps) whereas a properly designed one could perform the same task in less than five minutes. In the best case, a GPU implementation can yield these results in 57 s. Conclusions The present work shows how parallelism can be used in MATLAB for gene prediction in very large DNA sequences to produce results that are over 270 times faster than a conventional approach. This is significant as MATLAB is typically overlooked due to its apparent slow processing time even though it offers a convenient environment for bioinformatics. From a practical standpoint, this work proposes two strategies for accelerating genome data processing which rely on different parallelization mechanisms. Using a CPU, the work shows that direct access to the MEX function increases execution speed and that the PARFOR construct should be used in order to take full advantage of the parallelizable Goertzel implementation. When the target is a GPU, the work shows that data needs to be segmented into manageable sizes within the GFOR construct before processing in order to minimize execution time.

机译：背景技术本文提出了一种在MATLAB中实现并行基因预测算法的方法。拟议的设计基于Goertzel的算法或FFT，并已在中央处理单元（CPU）和图形处理单元（GPU）上使用变化的并行度实现。结果表明，采用直接方法的实现可能需要4.5小时以上的时间才能处理1500万个碱基对（bps），而设计正确的实现则可以在不到五分钟的时间内完成相同的任务。在最佳情况下，GPU实施可以在57秒内产生这些结果。结论本工作表明如何在MATLAB中使用并行性来预测非常大的DNA序列中的基因，以产生比常规方法快270倍的结果。这很重要，因为尽管MATLAB为生物信息学提供了便利的环境，但由于其明显的处理时间缓慢而通常被忽略。从实际的角度来看，这项工作提出了两种依靠不同并行化机制来加速基因组数据处理的策略。使用CPU，该工作表明对MEX函数的直接访问可提高执行速度，并且应使用PARFOR构造，以便充分利用可并行化的Goertzel实现。当目标是GPU时，该工作表明，在处理之前，需要在GFOR构造中将数据分割成可管理的大小，以最大程度地减少执行时间。

著录项

来源
《BMC research notes 》 |2012年第1期| 共页
作者
Sylvain Robert Rivard; Jean-Gabriel Mailloux; Rachid Beguenane; Hung Tien Bui;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类药物基础科学 ;
关键词

相似文献

外文文献
中文文献
专利

1. High-performance predictor for critical unstable generators based on scalable parallelized neural networks [J] . Youbo LIU, Yang LIU, Junyong LIU, Journal of Modern Power Systems and Clean Energy . 2016 ,第6期

机译：基于可扩展并行神经网络的关键不稳定发电机的高性能预测器
2. High-performance predictor for critical unstable generators based on scalable parallelized neural networks [J] . Youbo LIU, Yang LIU, Junyong LIU, 现代电力系统与清洁能源学报(英文) . 2016 ,第003期

机译：基于可扩展并行神经网络的关键不稳定发电机的高性能预测器
3. Optimum design of hybrid HVDC circuit breakers using a parallel genetic algorithm and a MATLAB-EMTP environment [J] . Javier A. Corea-Araujo, Juan A. Martinez-Velasco, Jesper Magnusson Generation, Transmission & Distribution, IET . 2017 ,第12期

机译：使用并行遗传算法和MATLAB-EMTP环境的混合HVDC断路器的优化设计
4. Optimization Design Of High-Performance Concrete Based On Genetic Algorithm Toolbox Of Matlab [C] . Xiansong Xie, Dongjin Yan, Yuezhai Zheng International Conference on Civil Engineering,Architecture and Building Materials . 2011

机译：基于Matlab遗传算法工具箱的高性能混凝土优化设计
5. On high-performance parallel fixed-point decimal multiplier designs. [D] . Zhu, Ming. 2013

机译：在高性能并行定点十进制乘法器设计上。
6. Design of high-performance parallelized gene predictors in MATLAB [O] . Sylvain Robert Rivard, Jean-Gabriel Mailloux, Rachid Beguenane, 2012

机译：MATLAB中高性能并行基因预测器的设计
7. Design of high-performance parallelized gene predictors in MATLAB [O] . Sylvain Rivard, Jean-Gabriel Mailloux, Rachid Beguenane, 2012

机译：MATLAB中高性能并行基因预测器的设计
8. Parallel Matlab: The Next Generation [R] . Kepner, J. , Travinin, N. 2003

机译：并行matlab：下一代

Design of high-performance parallelized gene predictors in MATLAB

摘要

著录项

相似文献

相关主题

期刊订阅