首页> 外文会议>International Reliability Physics Symposium >Exascale fault tolerance challenge and approaches
【24h】

Exascale fault tolerance challenge and approaches

机译:Exascale容错挑战和方法

获取原文

摘要

A geometrically increasing transistor count and a stagnant fault/transistor profile create a challenge in delivering a minimum acceptable user experience for the Exascale capable supercomputer. This situation propels fault-tolerant design priorities from the back ground to the foreground. Supercomputer fault tolerance must be a first class design concern for Exascale and beyond systems. Myriad solutions exist and can touch each level of the system from transistor, to circuit, micro-architecture, architecture, OS/Driver/Library, and application. Tools and methodologies to support this global effort with sufficient precision to enable trade-offs against and optimizations around power and performance are required with accuracy targeting at least 5 years into the future.
机译:几何级数增加的晶体管和停滞的故障/晶体管轮廓在为具备Exascale功能的超级计算机提供最低可接受的用户体验方面带来了挑战。这种情况将容错设计优先级从后台推到了前台。对于Exascale和其他系统,超级计算机的容错性必须是一流的设计关注点。存在无数种解决方案,它们可以涉及系统的各个级别,从晶体管到电路,微体系结构,体系结构,OS /驱动器/库和应用程序。需要以足够的精度来支持这项全球工作的工具和方法论,以在功率和性能之间进行权衡和优化,并且至少要在未来5年内实现精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号