...
【24h】

Fast Four-Way Parallel Radix Sorting on GPUs

机译:GPU上的快速四向并行基数排序

获取原文
获取原文并翻译 | 示例
           

摘要

Efficient sorting is a key requirement for many computer science algorithms. Acceleration of existing techniques as well as developing new sorting approaches is crucial for many real-time graphics scenarios, database systems, and numerical simulations to name just a few. It is one of the most fundamental operations to organize and filter the ever growing massive amounts of data gathered on a daily basis. While optimal sorting models for serial execution on a single processor exist, efficient parallel sorting remains a challenge. In this paper, we present a hardware-optimized parallel implementation of the radix sort algorithm that results in a significant speed up over existing sorting implementations. We outperform all known General Processing Unit (GPU) based sorting systems by about a factor of two and eliminate restrictions on the sorting key space. This makes our algorithm not only the fastest, but also the first general GPU sorting solution.
机译:高效排序是许多计算机科学算法的关键要求。加速现有技术以及开发新的排序方法对于许多实时图形场景,数据库系统和数值模拟至关重要。这是组织和筛选每天不断增长的海量数据的最基本操作之一。尽管存在用于在单个处理器上串行执行的最佳排序模型,但有效的并行排序仍然是一个挑战。在本文中,我们提出了基数排序算法的硬件优化并行实现,该算法大大提高了现有排序实现的速度。我们比所有已知的基于通用处理单元(GPU)的排序系统的性能高出大约两倍,并且消除了对排序键空间的限制。这使我们的算法不仅最快,而且是第一个通用的GPU排序解决方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号