首页> 外文期刊>Journal of systems architecture >Multiprogrammed non-blocking checkpoints in support of optimistic simulation on myrinet clusters
【24h】

Multiprogrammed non-blocking checkpoints in support of optimistic simulation on myrinet clusters

机译:多程序非阻塞检查点,支持对Myrinet群集进行乐观模拟

获取原文
获取原文并翻译 | 示例
           

摘要

CCL (checkpointing and communication library) is a software layer in support of optimistic parallel discrete event simulation (PDES) on myrinet-based COTS clusters. Beyond classical low latency message delivery functionalities, this library implements CPU offloaded, non-blocking (asynchronous) checkpointing functionalities based on data transfer capabilities provided by a programmable DMA engine on board of myrinet network cards. These functionalities are unique since optimistic simulation systems conventionally rely on checkpointing implemented as a synchronous, CPU-based data copy. Releases of CCL up to v2.4 only support monoprogrammed non-blocking checkpoints. This forces re-synchronization between CPU and DMA activities, which is a potential source of overhead, each time a new checkpoint request must be issued at the simulation application level while the last issued one is still being carried out by the DMA engine. In this paper we present a redesigned release of CCL (v3.0) that, exploiting hardware capabilities of more advanced myrinet clusters, supports multiprogrammed non-blocking checkpoints. The multiprogrammed approach allows higher degree of concurrency between checkpointing and other simulation specific operations carried out by the CPU, with benefits on performance. We also report the results of the experimental evaluation of those benefits for the case of a Personal Communication System (PCS) simulation application, selected as a real world test-bed.
机译:CCL(检查点和通信库)是一个软件层,用于支持基于myrinet的COTS群集上的乐观并行离散事件模拟(PDES)。除了经典的低延迟消息传递功能之外,该库还基于Myrinet网卡板上的可编程DMA引擎提供的数据传输功能,实现了CPU分载,无阻塞(异步)检查点功能。这些功能是独特的,因为乐观的仿真系统通常依赖于实现为基于CPU的同步数据副本的检查点。 CCL直至v2.4的发行版仅支持单程序非阻塞检查点。每次必须在模拟应用程序级别上发出新的检查点请求而DMA引擎仍在执行最后一个发出的检查点请求时,这会强制CPU和DMA活动之间重新同步,这是潜在的开销来源。在本文中,我们介绍了经过重新设计的CCL(v3.0)版本,该版本利用了更高级的myrinet群集的硬件功能,支持多程序化非阻塞检查点。多程序方法允许检查点与CPU执行的其他特定于仿真的操作之间更高的并发度,从而提高性能。对于个人通信系统(PCS)模拟应用程序的案例,我们还报告了这些好处的实验评估结果,该应用程序被选作现实世界的试验台。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号