Algorithms for high-throughput disk-to-disk sorting

机译：高吞吐量磁盘到磁盘排序的算法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we present a new out-of-core sort algorithm, designed for problems that are too large to fit into the aggregate RAM available on modern supercomputers. We analyze the performance including the cost of IO and demonstrate the fastest (to the best of our knowledge) reported throughput using the canonical sortBenchmark on a general-purpose, production HPC resource running Lustre. By clever use of available storage and a formulation of asynchronous data transfer mechanisms, we are able to almost completely hide the computation (sorting) behind the IO latency. This latency hiding enables us to achieve comparable execution times, including the additional temporary IO required, between a large sort problem (5TB) run as a single, in-RAM sort and our out-of-core approach using 1/10th the amount of RAM. In our largest run, sorting 100TB of records using 1792 hosts, we achieved an end-to-end throughput of 1.24TB/min using our general-purpose sorter, improving on the current Daytona record holder by 65%.

机译：在本文中，我们提出了一种新的核外排序算法，该算法针对的问题太大而无法放入现代超级计算机上可用的聚合RAM中。我们分析性能（包括IO成本），并在运行Lustre的通用生产HPC资源上使用规范的sortBenchmark证明（据我们所知）最快的报告吞吐量。通过巧妙地使用可用存储和制定异步数据传输机制，我们几乎可以将计算（排序）完全隐藏在IO延迟之后。这种延迟隐藏使我们能够在一个RAM排序中运行的大型排序问题（5TB）与使用1/10数量的内核外方法之间达到可比的执行时间，包括所需的额外临时IO。内存。在最大规模的运行中，使用1792台主机对100TB的记录进行了排序，使用通用分拣器实现的端到端吞吐量为1.24TB / min，与当前的Daytona记录保持器相比提高了65％。

著录项

来源
《International Conference for High Performance Computing, Networking, Storage and Analysis》|2013年|1-10|共10页
会议地点
作者
Sundar Hari; Malhotra Dhairya; Schulz Karl W.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Out-of-Core Algorithms; Parallel Algorithms; Sorting; asynchronous methods; distributed-memory parallelism; hypercube; quicksort; samplesort; shared-memory parallelism;

机译：核外算法;并行算法排序;异步方法;分布式内存并行性;超立方体快速排序样品分类共享内存并行;

相似文献

外文文献
中文文献
专利

1. Two-dimensional sorting algorithm for high-throughput K-best MIMO detection [J] . Wen Fan, Amir Alimohammad Communications, IET . 2017,第6期

机译：高通量K最佳MIMO检测的二维排序算法
2. Complexity Optimization and High-Throughput Low-Latency Hardware Implementation of a Multi-Electrode Spike-Sorting Algorithm [J] . Dragas Jelena, Jackel David, Hierlemann Andreas, Neural Systems and Rehabilitation Engineering, IEEE Transactions on . 2015,第2期

机译：多电极尖峰排序算法的复杂度优化和高通量低延迟硬件实现
3. Performance Study of Improved Heap Sort Algorithm and Other Sorting Algorithms on Different Platforms [J] . Vandana Sharma, Satwinder Singh, K. S. Kahlon International journal of computer science and network security . 2008,第4期

机译：改进的堆排序算法和其他排序算法在不同平台上的性能研究
4. Algorithms for high-throughput disk-to-disk sorting [C] . Sundar Hari, Malhotra Dhairya, Schulz Karl W. International Conference for High Performance Computing, Networking, Storage and Analysis . 2013

机译：高吞吐量磁盘对磁盘排序的算法
5. High-throughput piezoelectric-actuated micro-fluorescence-activated cell sorter (muFACS). [D] . Chen, Chun Hao (Randy). 2010

机译：高通量压电驱动的微荧光激活细胞分选仪（muFACS）。
6. Complexity Optimization and High-Throughput Low-Latency HardwareImplementation of a Multi-Electrode Spike-Sorting Algorithm [O] . Jelena Dragas, David Jäckel, Andreas Hierlemann, -1

机译：复杂度优化和高通量低延迟硬件多电极尖峰排序算法的实现
7. Efficient disk-to-disk sorting [O] . Hassan Eslami, Rajeev Thakur, Anthony Kougkas, 2015

机译：高效的磁盘到磁盘排序
8. PRIME and PDQ Sorts -Efficient Minimal Storage Sorting Algorithms. [R] . hilbrand,roy r. 1979

机译：pRImE和pDQ排序 - 高效的最小存储排序算法。

Algorithms for high-throughput disk-to-disk sorting

摘要

著录项

相似文献

相关主题

期刊订阅