首页> 外文会议> >Dynamic data replication: an approach to providing fault-tolerant shared memory clusters
【24h】

Dynamic data replication: an approach to providing fault-tolerant shared memory clusters

机译:动态数据复制:一种提供容错共享内存群集的方法

获取原文

摘要

A challenging issue in today's server systems is to transparently deal with failures and application-imposed requirements for continuous operation. In this paper we address this problem in shared virtual memory (SVM) clusters at the programming abstraction layer. We design extensions to an existing SVM protocol that has been tuned for low-latency, high-bandwidth interconnects and SMP nodes and we achieve reliability through dynamic replication of application shared data and protocol information. Our extensions allow us to tolerate single (or multiple, but not simultaneous) node failures. We implement our extensions on a state-of-the-art cluster and we evaluate the common, failure-free case. We find that, although the complexity of our protocol is substantially higher than its failure-free counterpart, by taking advantage of architectural features of modern systems our approach imposes low overhead and can be employed for transparently dealing with system failures.
机译:当今服务器系统中一个具有挑战性的问题是透明地处理故障和应用程序对连续操作提出的要求。在本文中,我们在编程抽象层的共享虚拟内存(SVM)群集中解决了此问题。我们设计了针对现有SVM协议的扩展,该协议已针对低延迟,高带宽互连和SMP节点进行了调整,并且我们通过动态复制应用程序共享数据和协议信息来实现可靠性。我们的扩展允许我们容忍单个(或多个,但不是同时的)节点故障。我们在最先进的集群上实施扩展,并评估常见的无故障情况。我们发现,尽管我们协议的复杂度大大高于无故障协议,但是通过利用现代系统的体系结构功能,我们的方法的开销很小,可用于透明地处理系统故障。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号