...
首页> 外文期刊>Parallel Computing >I/O-aware bandwidth allocation systems for petascale computing
【24h】

I/O-aware bandwidth allocation systems for petascale computing

机译:用于PB级计算的I / O感知带宽分配系统

获取原文
获取原文并翻译 | 示例
           

摘要

In the Big Data era, the gap between the storage performance and an application's I/O requirement is increasing. I/O congestion caused by concurrent storage accesses from multiple applications is inevitable and severely harms the performance. Conventional approaches either focus on optimizing an application's access pattern individually or handle I/O requests on a low-level storage layer without any knowledge from the upper-level applications. In this paper, we present a novel I/O-aware bandwidth allocation framework to coordinate ongoing I/O requests on petascale computing systems. The motivation behind this innovation is that the resource management system has a holistic view of both the system state and jobs' activities and can dynamically control the jobs' status or allocate resource on the fly during their execution. We treat a job's I/O requests as periodical sub jobs within its lifecycle and transform the I/O congestion issue into a classical scheduling problem. Based on this model, we propose a bandwidth management mechanism as an extension to the existing scheduling system. We design several bandwidth allocation policies with different optimization objectives either on user-oriented metrics or system performance. We conduct extensive trace-based simulations using real job traces and I/O traces from a production IBM Blue Gene/Q system at Argonne National Laboratory. Experimental results demonstrate that our new design can improve job performance by more than 30%, as well as increasing system performance. (C) 2016 Elsevier B.V. All rights reserved.
机译:在大数据时代,存储性能与应用程序的I / O需求之间的差距越来越大。来自多个应用程序的并发存储访问不可避免地导致I / O拥塞,并严重损害性能。传统方法要么专注于单独优化应用程序的访问模式,要么在高层存储应用程序不了解的情况下,在低层存储层上处理I / O请求。在本文中,我们提出了一种新颖的I / O感知带宽分配框架,以协调Petascale计算系统上正在进行的I / O请求。这种创新背后的动机是,资源管理系统对系统状态和作业的活动具有整体的看法,并且可以动态地控制作业的状态或在执行期间动态分配资源。我们将作业的I / O请求视为其生命周期中的定期子作业,并将I / O拥塞问题转换为经典的调度问题。基于此模型,我们提出了一种带宽管理机制,作为对现有调度系统的扩展。我们针对面向用户的指标或系统性能设计了几种具有不同优化目标的带宽分配策略。我们使用来自Argonne国家实验室的生产IBM Blue Gene / Q系统的实际作业跟踪和I / O跟踪进行基于跟踪的广泛模拟。实验结果表明,我们的新设计可以将工作性能提高30%以上,并可以提高系统性能。 (C)2016 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号