首页> 外文会议>International Conference for High Performance Computing, Networking, Storage and Analysis >A practical approach to reconciling availability, performance, and capacity in provisioning extreme-scale storage systems
【24h】

A practical approach to reconciling availability, performance, and capacity in provisioning extreme-scale storage systems

机译:在提供极限存储系统中协调可用性,性能和容量的实用方法

获取原文

摘要

The increasing data demands from high-performance computing applications significantly accelerate the capacity, capability and reliability requirements of storage systems. As systems scale, component failures and repair times increase, significantly impacting data availability. A wide array of decision points must be balanced in designing such systems. We propose a systematic approach that balances and optimizes both initial and continuous spare provisioning based on a detailed investigation of the anatomy and field failure data analysis of extreme-scale storage systems. We consider the component failure characteristics and its cost and impact at the system level simultaneously. We build a tool to evaluate different provisioning schemes, and the results demonstrate that our optimized provisioning can reduce the duration of data unavailability by as much as 52% under a fixed budget. We also observe that non-disk components have much higher failure rates than disks, and warrant careful considerations in the overall provisioning process.
机译:高性能计算应用的数据需求越来越大,存储系统的容量,能力和可靠性要求。随着系统规模,组件故障和修复时间增加,显着影响数据可用性。在设计这种系统时必须平衡各种决策点。我们提出了一种系统的方法,基于对极度存储系统的解剖和现场故障数据分析的详细研究,进行了系统的余额和优化初始和连续备用供应。我们将组件故障特征及其成本及其成本同时考虑在系统级别。我们构建一个工具来评估不同的配置方案,结果表明,我们的优化配置可以在固定预算下将数据的持续时间减少到52%。我们还观察到,非磁盘组件具有比磁盘更高的故障率,并在整体供应过程中仔细考虑。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号