Parallelizing Approximate Search on Adaptive Radix Trees

机译：对自适应基数树的平行化近似搜索

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Efficient searching in a number of strings is a common task in computer science and radix-trees are often used as a compact storage for string saving. One variant is the Adaptive-Radix-Tree (ART). ART has adaptive node sizes for more compact and cache-friendly memory layout. Graphical Processing Units (GPUs) can be used as hardware accelerator for massively parallel tasks and in addition they use very fast memory. We propose a parallel approximate search in the ART on CPU and GPU to optimize the throughput of queries and speed up applications that depends on these algorithms. Thereby we use the edit distance to compare two search keys in the tree and select appropriate values. We use the CPU for experimental comparison with the GPU, which have several thousand cores and modern processors typically have four to several dozens cores, but theses cores and RAM are more flexible. We propose several variations of the CPU algorithm like fixed vs. dynamic memory layouts and pointer vs. pointer-less data structures. In our experimental evaluation with OpenCL on ROCm 3.0, AMDs platform for GPU-Enabled HPC and Ultrascale Computing, the speedup and throughput of the GPU implementation for the approximate search in comparison with the best CPU variant are in the maximum up to factor 4.16 depending on the size of the tree and batch size. The speedup between the best and the worst CPU algorithm is up to factor 11.67, depending on tree and batch size.

机译：在许多字符串中的高效搜索是计算机科学中的常见任务，并且基地树通常用作串节省的紧凑存储器。一个变体是自适应 - 基地树（ART）。 ART具有自适应节点大小，可用于更紧凑和缓存友好的内存布局。图形处理单元（GPU）可用作硬件加速器，用于大规模并行任务，并因此使用非常快速的内存。我们提出了在CPU和GPU上的技术方面的平行近似搜索，以优化查询的吞吐量和加速取决于这些算法的应用程序。因此，我们使用编辑距离来比较树中的两个搜索密钥，并选择适当的值。我们使用CPU与GPU进行实验比较，其中有几千个核心和现代处理器通常具有四个到几十个核心，而是核心和RAM更加灵活。我们提出了像固定与动态内存布局和指针与指针数据结构一样的CPU算法的多个变体。在我们对Roccl 3.0上的OpenCL的实验评估中，AMDS支持GPU的HPC和UltraScale Computing的平台，与最佳CPU变体相比，GPU实现的GPU实现的加速和吞吐量在最大程度上至4.16，具体取决于因素4.16树和批次大小的大小。最佳和最差CPU算法之间的加速度最多可为12.67，具体取决于树和批次大小。

著录项

来源
《Italian Symposium on Advanced Database Systems》|2020年|356p|共12页
会议地点
作者
Tobias Groth; Sven Groppe; Martin Koppehel; Thilo Pionteck;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP392-53;
关键词
Adaptive-Radix-Tree (ART); CPU acceleration; GPU acceleration; OpenCL; Edit-distance; Parallel; Approximate search;

机译：Adaptive-Radix-Tree（ART）;CPU加速;GPU加速;OpenCL;编辑距离;并行;近似搜索;

相似文献

外文文献
中文文献
专利

1. Massive parallelization of approximate nearest neighbor search on KD-tree for high-dimensional image descriptor matching [J] . Hu Linjia, Nooshabadi Saeid Journal of visual communication & image representation . 2017,第APRa期

机译：高维图像描述符匹配的KD树上近似最近邻搜索的大规模并行化
2. High-dimensional image descriptor matching using highly parallel KD-tree construction and approximate nearest neighbor search [J] . Hu Linjia, Nooshabadi Saeid Journal of Parallel and Distributed Computing . 2019,第OCTa期

机译：使用高度并行的KD树结构和近似最近邻搜索进行高维图像描述符匹配
3. High-dimensional image descriptor matching using highly parallel KD-tree construction and approximate nearest neighbor search [J] . Hu Linjia, Nooshabadi Saeid Journal of Parallel and Distributed Computing . 2019,第Octa期

机译：使用高度平行的KD树构建和近似最近邻搜索匹配的高维图像描述符
4. Parallelizing Approximate Search on Adaptive Radix Trees [C] . Tobias Groth, Sven Groppe, Martin Koppehel, Italian Symposium on Advanced Database Systems . 2020

机译：对自适应基数树的平行化近似搜索
5. The Area Code Tree for Approximate Nearest Neighbour Search in Dense Point Sets [D] . Rahman, Fatema. 2018

机译：密集点集中近似最近邻居搜索的区号树
6. Parallelization of enumerating tree-like chemical compounds by breadth-first search order [O] . Morihiro Hayashida, Jira Jindalertudomdee, Yang Zhao, 2015

机译：通过广度优先搜索顺序对树状化合物进行枚举
7. Adaptive memory programming: local search parallel algorithms for phylogenetic tree construction [O] . Jacek Blazewicz, Piotr Formanowicz, Pawel Kedziora, 2010

机译：自适应内存编程：系统发育树构建的本地搜索并行算法

Parallelizing Approximate Search on Adaptive Radix Trees

摘要

著录项

相似文献

相关主题

期刊订阅