首页> 外文会议>Hawaii international conference on system science;HICSS-31 >CUMULVS: Extending a generic steering and visualization middleware for application fault-tolerance
【24h】

CUMULVS: Extending a generic steering and visualization middleware for application fault-tolerance

机译:CUMULVS:扩展通用的操纵和可视化中间件以实现应用程序容错

获取原文

摘要

CUMULVS is a middleware library that provides application programmers with a simple API for describing viewable and steerable fields in large-scale distributed simulations.These descriptions provide the data type,a logical name of the field/parameter,and the mapping of global indices to local indices (processor and physical storage) for distributed data fields.The CUMULVS infrastructure uses these descriptions to allow an arbitrary number of front-end "viewer" programs to dynamically attach to a running simulation,select one or more fields for visualization,and up date steerable variables.(Viewer programs can be built using commercial visualization software such as AVS or custom software based on GUI interface builders like Tcl/Tk.) Although these data dield descriptions require a small effort on the part of the application programmer,the payoff is a high degree of flexibility for the infrastructure and end-user.This flesxibility has allowed us to extend the infrastructure to include "application-directed" checkpointing,where the application determines the essential state that must be saved for a restart.This has the advantage that checkpoints can be smaller and made portable across heterogeneous architectures using the semantic description information that can be included in the checkpoint file.Becaruse many technical difficulties,such as efficient I/O handling and time-coherency of data,are shared between visualization and checkpointing,it is advantageous to leverage a checkpoint/restart system against a visualization/steering infrastructure. Also,because CUMULVS "understands" parallel data distributions,efficient parallel checkpointing is achievable with a minimal amount of effort on the programmer's part.However,application scientists must still determine what makes up the essential state needed for an application restart and provide the proper logic for restarting from a checkpoint versus normal startup.This paper will outline the structure and communication protocols used by CUMULVS for visualization and steering.We will develop the similarities and differences between userdirected checkpointing and CUMULVS-based visualization.Finally,these concepts will be illustrated using a large synthitic seismic dataset code.
机译:CUMULVS是一个中间件库,它为应用程序程序员提供了一个简单的API,用于描述大规模分布式仿真中的可见字段和可操纵字段。这些描述提供了数据类型,字段/参数的逻辑名以及全局索引到本地的映射。 CUMULVS基础结构使用这些描述来允许任意数量的前端“查看器”程序动态地附加到正在运行的模拟中,选择一个或多个字段进行可视化,以及更新日期可操纵的变量。(可以使用诸如AVS之类的商业可视化软件或基于GUI接口构建器(如Tcl / Tk)的自定义软件来构建Viewer程序。)尽管这些数据字段描述对于应用程序程序员而言需要花费很少的精力,但回报是基础架构和最终用户具有高度的灵活性。这种灵活的功能使我们能够扩展基础架构,使其包括“ “应用程序定向”检查点,其中应用程序确定必须保存以重新启动的基本状态。这具有以下优点:检查点可以更小,并且可以使用可包含在检查点文件中的语义描述信息在异构体系结构之间移植。由于在可视化和检查点之间共享许多技术难题,例如有效的I / O处理和数据的时间一致性,因此在可视化/操纵基础结构上利用检查点/重新启动系统是有利的。另外,由于CUMULVS“了解”并行数据分布,因此只需花费最少的精力即可实现高效的并行检查点。但是,应用程序科学家仍必须确定什么构成了应用程序重新启动所需的基本状态并提供适当的逻辑本文将概述CUMULVS用于可视化和操纵的结构和通信协议。我们将开发用户控制的检查点与基于CUMULVS的可视化之间的异同。最后,将使用以下方法来说明这些概念大型合成地震数据集代码。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号