On Longest Repeat Queries Using GPU

机译：使用GPU的最长重复查询

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Repeat finding in strings has important applications in sub-fields such as computational biology. The challenge of finding the longest repeats covering particular string positions was recently proposed and solved by Ileri et al., using a total of the optimal O(n) time and space, where n is the string size. However, their solution can only find the leftmost longest repeat for each of the n string position. It is also not known how to parallelize their solution. In this paper, we propose a new solution for longest repeat finding, which although is theoretically suboptimal in time but is conceptually simpler and works faster and uses less memory space in practice than the optimal solution. Further, our solution can find all longest repeats of every string position, while still maintaining a faster processing speed and less memory space usage. Moreover, our solution is parallelizable in the shared memory architecture (SMA), enabling it to take advantage of the modern multi-processor computing platforms such as the general-purpose graphics processing units (GPU). We have implemented both the sequential and parallel versions of our solution. Experiments with both biological and non-biological data show that our sequential and parallel solutions are faster than the optimal solution by a factor of 2-3.5 and 6-14, respectively, and use less memory space.

机译：重复在字符串中查找具有在诸如计算生物学的子字段中的重要应用。最近通过ILERI等人提出并解决了覆盖特定字符串位置的最长重复的挑战。，使用总数的O（n）时间和空间，其中n是字符串大小。但是，它们的解决方案只能为每个N个字符串位置找到最左边的重复。它还不知道如何并行化其解决方案。在本文中，我们提出了一种新的解决方案，以便最长的重复发现，尽管在理论上是次优，但在概念上更简单，工作更快，并且在实践中使用较少的内存空间而不是最佳解决方案。此外，我们的解决方案可以找到每个字符串位置的所有最长重复，同时仍保持更快的处理速度和更少的内存空间使用情况。此外，我们的解决方案在共享内存架构（SMA）中并行，使其能够利用现代多处理器计算平台，例如通用图形处理单元（GPU）。我们已经实施了我们解决方案的顺序和并行版本。具有生物和非生物数据的实验表明，我们的顺序和平行溶液分别比最佳溶液更快，分别为2-3.5和6-14倍，并使用更少的存储空间。

著录项

来源
《International conference on database systems for advanced applications》|2015年||共18页
会议地点
作者
Yun Tian; Bojian Xu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP311.13;
关键词
String; Repeats; Longest repeats; Parallel computing; Stream compaction; GPU; CUDA;

机译：字符串;重复;最长的重复;并行计算;流压实;GPU;CUDA;

相似文献

外文文献
中文文献
专利

1. Performance evaluation of single vs. batch of queries on GPUs [J] . Gaioso Roussian, Gil-Costa Veronica, Guardia Helio, Concurrency, practice and experience . 2020,第20期

机译：单次与GPU上的批量查询的性能评估
2. An evaluation of analytical queries on CPUs and coupled GPUs [J] . Hua Luan, Lei Chang Concurrency, practice and experience . 2017,第5期

机译：对CPU和耦合GPU的分析查询的评估
3. Speeding up Spatial Database Query Execution using GPUs [J] . Bogdan Simion, Suprio Ray, Angela Demke Brown Procedia Computer Science . 2012,第1期

机译：使用GPU加速空间数据库查询执行
4. On Longest Repeat Queries Using GPU [C] . Yun Tian, Bojian Xu International conference on database systems for advanced applications;International workshop on Semantic computing and personalization;International workshop on big data management and service . 2015

机译：使用GPU进行最长重复查询
5. Relational Joins on GPUs for In-memory Database Query Processing [D] . Rui, Ran. 2020

机译：关于内存数据库查询处理的GPU上的关系连接
6. Concurrent query processing in a GPU-based database system [O] . Hao Li, Yi-Cheng Tu, Bo Zeng 2012

机译：基于GPU的数据库系统中的并发查询处理
7. On Longest Repeat Queries Using GPU [O] . Tian, Yun, Xu, Bojian 2015

机译：使用GpU进行最长重复查询

On Longest Repeat Queries Using GPU

摘要

著录项

相似文献

相关主题

期刊订阅