Reconciling scratch space consumption, exposure, and volatility to achieve timely staging of job input data

机译：协调暂存空间的消耗，暴露和波动，以实现工作输入数据的及时登台

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Innovative scientific applications and emerging dense data sources are creating a data deluge for highend computing systems. Processing such large input data typically involves copying (or staging) onto the supercomputer's specialized high-speed storage, scratch space, for sustained high I/O throughput. The current practice of conservatively staging data as early as possible makes the data vulnerable to storage failures, which may entail re-staging and consequently reduced job throughput. To address this, we present a timely staging framework that uses a combination of job startup time predictions, user-specified intermediate nodes, and decentralized data delivery to coincide input data staging with job start-up. By delaying staging to when it is necessary, the exposure to failures and its effects can be reduced. Evaluation using both PlanetLab and simulations based on three years of Jaguar (No. 1 in Top500) job logs show as much as 85.9% reduction in staging times compared to direct transfers, 75.2% reduction in wait time on scratch, and 2.4% reduction in usage/hour.

机译：创新的科学应用和新兴的密集数据源正在为高端计算系统创建大量数据。处理如此大的输入数据通常需要将复制（或暂存）到超级计算机的专用高速存储（暂存空间）上，以保持较高的I / O吞吐量。当前的尽早保守地存储数据的做法使数据容易受到存储故障的影响，这可能需要重新存储并因此降低作业吞吐量。为了解决这个问题，我们提出了一个及时的登台框架，该框架结合了作业启动时间预测，用户指定的中间节点和分散式数据传递的组合，以使输入数据登台与作业启动相一致。通过将升级延迟到必要时，可以减少发生故障的可能性及其影响。使用PlanetLab和基于三年Jaguar（在Top500中排名第一）工作日志的模拟进行的评估显示，与直接转移相比，暂存时间最多减少了85.9％，暂存的等待时间减少了75.2％，而直接转移的等待时间减少了2.4％。用量/小时。

著录项

来源
《2010 IEEE International Symposium on Parallel amp; Distributed Processing (IPDPS)》|2010年|P.1-12|共12页
会议地点 Atlanta GA(US);Atlanta GA(US)
作者
Monti Henry M.; Butt Ali R.; Vazhkudai Sudharshan S.;
展开▼
作者单位

Dept. of Computer Science, Virginia Tech. Blacksburg, Virginia, USA;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类 TP311.133;
关键词
HPC center serviceability; High performance data management; data-staging; end-user data delivery;

机译：HPC中心可维护性；高性能数据管理；数据分段；最终用户数据交付;

相似文献

外文文献
中文文献
专利

1. On timely staging of HPC job input data [J] . Anoop Malaviya Computing reviews . 2014,第4期

机译：及时分阶段执行HPC作业输入数据
2. On Timely Staging of HPC Job Input Data [J] . Monti, Henry M., Butt, IEEE Transactions on Parallel and Distributed Systems . 2013,第9期

机译：关于HPC作业输入数据的及时登台
3. Timely Result-Data Offloading for Improved HPC Center Scratch Provisioning and Serviceability [J] . Monti Henry M., Butt Ali R., Vazhkudai Sudharshan S. Parallel and Distributed Systems, IEEE Transactions on . 2011,第8期

机译：及时卸载结果数据，以改善HPC Center Scratch设置和可维护性
4. Reconciling scratch space consumption, exposure, and volatility to achieve timely staging of job input data [C] . Monti H.M., Butt A.R., Vazhkudai S.S. 2010 IEEE International Symposium on Parallel amp; Distributed Processing (IPDPS) . 2010

机译：协调暂存空间的消耗，暴露和波动，以实现工作输入数据的及时登台
5. Evaluation of Water consumption and savings achieved in Datacenters through Air side Economization [D] . Mishra, Ravi. 2016

机译：通过空气侧经济化评估数据中心实现的耗水量和节省
6. Association between time-weighted activity space-based exposures to fast food outlets and fast food consumption among young adults in urban Canada [O] . Bochu Liu, Michael Widener, Thomas Burgoine, 2020

机译：时间加权活动基于空间的快餐店暴露与加拿大城市年轻人的快餐消费之间的关联
7. Just-in-time staging of large input data for supercomputing jobs [O] . Henry M. Monti, Ali R. Butt, Sudharshan S. Vazhkudai 2008

机译：即时为超级计算作业分配大量输入数据
8. Reconciling Urban VOC/NOx (Volatile Organic Compounds/NOx) Emission Inventories with Ambient Concentration Data [R] . Ching, J. K. S. , Novak, J. H. , Schere, K. L. , 1987

机译：用环境浓度数据协调城市VOC / NOx（挥发性有机化合物/ NOx）排放清单

Reconciling scratch space consumption, exposure, and volatility to achieve timely staging of job input data

摘要

著录项

相似文献

相关主题

期刊订阅