【24h】

Heterogeneous Parallelization of Aho-Corasick Algorithm

机译:AHO-Corasick算法的异构并行化

获取原文

摘要

Pattern discovery is one of the fundamental tasks in bioinformatics and pattern recognition is a powerful technique for searching sequence patterns in the biological sequence databases. The significant increase in the number of DNA and protein sequences expands the need for raising the performance of pattern matching algorithms. For this purpose, heterogeneous architectures can be a good choice due to their potential for high performance and energy efficiency. In this paper we present an efficient implementation of Aho-Corasick (AC) and PFAC (Parallel Failureless Aho-Corasick) algorithm on a heterogeneous CPU/GPU architecture. We progressively redesigned the algorithms and data structures to fit on the GPU architecture. Our results on different protein sequence data sets show 15% speedup comparing to the original implementation of the PFAC algorithm.
机译:模式发现是生物信息学的基本任务之一,模式识别是用于搜索生物序列数据库中的序列模式的强大技术。 DNA和蛋白质序列数量的显着增加扩展了提高模式匹配算法的性能的需要。为此目的,由于其高性能和能源效率的潜力,异构架构可能是一个不错的选择。在本文中,我们在异构CPU / GPU架构上呈现了AHO-Corasick(AC)和PFAC(并行无能AHO-Corasick)算法的高效实现。我们逐步重新设计了算法和数据结构以适应GPU架构。我们在不同蛋白质序列数据集上的结果显示了与PFAC算法的原始实现相比,加速了15%的加速。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号