【24h】

Perspectives on Anomaly and Event Detection in Exascale Systems

机译:百亿亿次系统中异常和事件检测的观点

获取原文

摘要

The design and implementation of exascale system is nowadays an important challenge. Such a system is expected to combine HPC with Big Data methods and technologies to allow the execution of scientific workloads which are not tractable at this present time. In this paper we focus on an event and anomaly detection framework which is crucial in giving a global overview of a exascale system (which in turn is necessary for the successful implementation and exploitation of the system). We propose an architecture for such a framework and show how it can be used to handle failures during job execution.
机译:如今,百亿亿美元系统的设计和实现是一项重要的挑战。期望这样的系统将HPC与大数据方法和技术相结合,以允许执行目前尚无法处理的科学工作负载。在本文中,我们着重于事件和异常检测框架,这对于提供亿亿级系统的全局概述至关重要(而这对于成功实施和利用该系统是必不可少的)。我们为这种框架提出了一种架构,并展示了如何在作业执行过程中使用它来处理故障。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号