【24h】

Algorithms for high-throughput disk-to-disk sorting

机译:高吞吐量磁盘到磁盘排序的算法

获取原文

摘要

In this paper, we present a new out-of-core sort algorithm, designed for problems that are too large to fit into the aggregate RAM available on modern supercomputers. We analyze the performance including the cost of IO and demonstrate the fastest (to the best of our knowledge) reported throughput using the canonical sortBenchmark on a general-purpose, production HPC resource running Lustre. By clever use of available storage and a formulation of asynchronous data transfer mechanisms, we are able to almost completely hide the computation (sorting) behind the IO latency. This latency hiding enables us to achieve comparable execution times, including the additional temporary IO required, between a large sort problem (5TB) run as a single, in-RAM sort and our out-of-core approach using 1/10th the amount of RAM. In our largest run, sorting 100TB of records using 1792 hosts, we achieved an end-to-end throughput of 1.24TB/min using our general-purpose sorter, improving on the current Daytona record holder by 65%.
机译:在本文中,我们提出了一种新的核外排序算法,该算法针对的问题太大而无法放入现代超级计算机上可用的聚合RAM中。我们分析性能(包括IO成本),并在运行Lustre的通用生产HPC资源上使用规范的sortBenchmark证明(据我们所知)最快的报告吞吐量。通过巧妙地使用可用存储和制定异步数据传输机制,我们几乎可以将计算(排序)完全隐藏在IO延迟之后。这种延迟隐藏使我们能够在一个RAM排序中运行的大型排序问题(5TB)与使用1/10数量的内核外方法之间达到可比的执行时间,包括所需的额外临时IO。内存。在最大规模的运行中,使用1792台主机对100TB的记录进行了排序,使用通用分拣器实现的端到端吞吐量为1.24TB / min,与当前的Daytona记录保持器相比提高了65%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号