Bit-parallel approximate pattern matching: Kepler GPU versus Xeon Phi

Tuan Tu Tran; Liu Yongchao; Schmidt Bertil

首页> 外文期刊>Parallel Computing >Bit-parallel approximate pattern matching: Kepler GPU versus Xeon Phi

【24h】

Bit-parallel approximate pattern matching: Kepler GPU versus Xeon Phi

机译：位并行近似模式匹配：开普勒GPU与至强融核

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Approximate pattern matching (APM) targets to find the occurrences of a pattern inside a subject text allowing a limited number of errors. It has been widely used in many application areas such as bioinformatics and information retrieval. Bit-parallel APM takes advantage of the intrinsic parallelism of bitwise operations inside a machine word. This approach typically encodes non-deterministic finite automaton (NFA) states or value differences between adjacent cells of a dynamic programming matrix in the form of bit arrays. Wu-Manber (WM) is a well-known bit-parallel APM algorithm, which simulates an NFA and gains parallel efficiency by performing multiple state updates within a machine word. An important parameter is the machine word size (e.g. 32 or 64 bits for CPUs). Due to increasing vector capabilities, efficient mapping of bit-parallel APM algorithms onto modern high performance computing architectures is an interesting research topic. Prominent examples are Xeon Phi coprocessors and CUDA-enabled GPUs, which provide words of size 512 bits (by means of vector registers) and 1024 bits (by means of warps), respectively. In this paper, we investigate mappings of the WM algorithm onto these two accelerator types. Both architectures are able to achieve around two orders-of-magnitude speedups compared to a single-threaded CPU implementation. Moreover, our tile-based implementation on a GeForce Titan graphics card runs up to 2.9 x faster than our implementation on an Intel Xeon Phi 5110P. Source code is available at http://xbitpar.sourceforge.net. (C) 2015 Elsevier B.V. All rights reserved.

机译：近似模式匹配（APM）的目标是在允许有限数量错误的主题文本中查找模式的出现。它已被广泛应用于许多领域，例如生物信息学和信息检索。位并行APM利用了机器字内部按位运算的固有并行性。这种方法通常以位阵列的形式编码非确定性有限自动机（NFA）状态或动态编程矩阵的相邻单元之间的值差。 Wu-Manber（WM）是一种著名的位并行APM算法，它模拟NFA并通过在一个机器字内执行多个状态更新来获得并行效率。一个重要的参数是机器字的大小（例如CPU的32或64位）。由于矢量功能的增强，将位并行APM算法有效映射到现代高性能计算体系结构是一个有趣的研究主题。 Xeon Phi协处理器和支持CUDA的GPU是突出的例子，它们分别提供大小为512位（通过向量寄存器）和1024位（通过warp）的字。在本文中，我们研究了WM算法在这两种加速器类型上的映射。与单线程CPU实施相比，这两种架构都可以实现大约两个数量级的加速。此外，与在Intel Xeon Phi 5110P上的实现相比，我们在GeForce Titan显卡上基于图块的实现运行速度快2.9倍。源代码可从http://xbitpar.sourceforge.net获得。（C）2015 Elsevier B.V.保留所有权利。

著录项

来源
《Parallel Computing》 |2016年第5期|128-138|共11页
作者
Tuan Tu Tran; Liu Yongchao; Schmidt Bertil;
展开▼
作者单位

Johannes Gutenberg Univ Mainz, Inst Informat, D-55128 Mainz, Germany;

Georgia Inst Technol, Sch Computat Sci & Engn, Atlanta, GA 30332 USA;

Johannes Gutenberg Univ Mainz, Inst Informat, D-55128 Mainz, Germany;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Bit-parallel; Approximate pattern matching; Wu-Manber algorithm; CUDA; GPU; Xeon Phi;

机译：位并行;近似模式匹配;Wu-Manber算法;CUDA;GPU;Xeon Phi;

相似文献

外文文献
中文文献
专利

1. Quantum Chemical Calculations Using Accelerators: Migrating Matrix Operations to the NVIDIA Kepler GPU and the Intel Xeon Phi [J] . Sarom S. Leang, Alistair P. Rendell, Mark S. Gordon Journal of chemical theory and computation: JCTC . 2014,第3期

机译：使用加速器进行量子化学计算：将矩阵运算迁移到NVIDIA Kepler GPU和Intel Xeon Phi
2. Bit-Parallel Multiple Approximate String Matching based on GPU [J] . Kefu Xu, Wenke Cui, Yue Hu, Procedia Computer Science . 2013,第1期

机译：基于GPU的位并行多重近似字符串匹配
3. Efficient bit-parallel multi-patterns approximate string matching algorithms [J] . Rajesh Prasad, Anuj Kumar Sharma, Alok Singh, Scientific Research and Essays . 2011,第4期

机译：高效的位并行多模式近似字符串匹配算法
4. Bit-Parallel Approximate Pattern Matching on the Xeon Phi Coprocessor [C] . Tran Tuan Tu, Schindel Simon, Liu Yongchao, International symposium on computer architecture and high performance computing . 2014

机译：至强融核协处理器上的位并行近似模式匹配
5. Accelerating Pattern Matching in Neuromorphic Text Recognition System Using Intel Xeon Phi Coprocessor. [D] . Ahmed, Khadeer. 2014

机译：使用Intel Xeon Phi协处理器加速神经形态文本识别系统中的模式匹配。
6. Comparative Performance Analysis of Intel Xeon Phi GPU and CPU: A Case Study from Microscopy Image Analysis [O] . George Teodoro, Tahsin Kurc, Jun Kong, -1

机译：英特尔至强融核GPU和CPU的比较性能分析：以显微镜图像分析为例
7. Performance of Kepler GTX Titan GPUs and Xeon Phi system [O] . Hwancheol Jeong, Weonjong Lee, Jeonghwan Pak, 2014

机译：ePperer GTX Titan GPU和Xeon Phi系统的性能
8. Performance Testing of GPU-Based Approximate Matching Algorithm on Network Traffic. [R] . Jimoh, M. B. 2015

机译：基于GpU的网络流量近似匹配算法性能测试。

Bit-parallel approximate pattern matching: Kepler GPU versus Xeon Phi

摘要

著录项

相似文献

相关主题

期刊订阅