首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >Towards Exploring Data-Intensive Scientific Applications at Extreme Scales through Systems and Simulations
【24h】

Towards Exploring Data-Intensive Scientific Applications at Extreme Scales through Systems and Simulations

机译:通过系统和仿真来探索极端规模的数据密集型科学应用

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

The state-of-the-art storage architecture of high-performance computing systems was designed decades ago, and with today's scale and level of concurrency, it is showing significant limitations. Our recent work proposed a new architecture to address the I/O bottleneck of the conventional wisdom, and the system prototype (FusionFS) demonstrated its effectiveness on up to 16 K nodes—the scale on par with today's largest supercomputers. The main objective of this paper is to investigate FusionFS's scalability towards exascale. Exascale computers are predicted to emerge by 2018, comprising millions of cores and billions of threads. We built an event-driven simulator (FusionSim) according to the FusionFS architecture, and validated it with FusionFS's traces. FusionSim introduced less than 4 percent error between its simulation results and FusionFS traces. With FusionSim we simulated workloads on up to two million nodes and find out almost linear scalability of I/O performance; results justified FusionFS's viability for exascale systems. In addition to the simulation work, this paper extends the FusionFS system prototype in the following perspectives: (1) the fault tolerance of file metadata is supported, (2) the limitations of the current system design is discussed, and (3) a more thorough performance evaluation is conducted, such as N-to-1 metadata write, system efficiency, and more platforms such as Amazon Cloud.
机译:高性能计算系统的最新存储体系结构是几十年前设计的,而在当今的规模和并发水平下,它显示出了巨大的局限性。我们最近的工作提出了一种新的体系结构,以解决传统知识的I / O瓶颈,并且系统原型(FusionFS)在多达16个K节点上证明了其有效性-与当今最大的超级计算机规模相当。本文的主要目的是研究FusionFS的扩展性。 Exascale计算机预计将在2018年出现,其中包括数百万个内核和数十亿个线程。我们根据FusionFS架构构建了一个事件驱动模拟器(FusionSim),并使用FusionFS的踪迹对其进行了验证。 FusionSim在其仿真结果和FusionFS跟踪之间引入了不到4%的误差。借助FusionSim,我们可以模拟多达200万个节点上的工作负载,并发现I / O性能几乎呈线性可扩展性。结果证明了FusionFS在百亿亿次系统中的可行性。除仿真工作外,本文还从以下几个方面扩展了FusionFS系统原型:(1)支持文件元数据的容错能力;(2)讨论了当前系统设计的局限性;(3)更多信息进行了全面的性能评估,例如N对1元数据写入,系统效率以及更多平台(例如Amazon Cloud)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号