...
首页> 外文期刊>International journal of software engineering and knowledge engineering >Improving Resilience of Software Systems: A Case Study in 3D-Online Game System
【24h】

Improving Resilience of Software Systems: A Case Study in 3D-Online Game System

机译:增强软件系统的弹性:以3D在线游戏系统为例

获取原文
获取原文并翻译 | 示例

摘要

Resilience is the property that enables a system to continue operating properly when one or more faults occur. Nowadays, as software systems become more and more complex, their hardware execution platforms also become more heterogenous with larger scale. Software systems may fail due to some faults such as node breakdown, communication failure, or data processing failure. In this paper, we propose a ring-based resilience mechanism, which implements fault detection and recovery. (1) To solve the problem that the central server may have high burden of network traffic, we design a ring-based heartbeat algorithm for crash fault detection. (2) We also design a light-weight recovery mechanism to recover from crash faults as compared with the current system-specific mechanisms. To evaluate our mechanism, we use a 3D-online game system as a case study. By injecting faults, we test the effectiveness and overhead of the proposed mechanism. Compared with other mechanisms, the experimental results show that our mechanism can support resilience very well and is better at dealing with the crash fault caused by high cluster workload with acceptable overhead.
机译:弹性是使系统在发生一个或多个故障时能够继续正常运行的属性。如今,随着软件系统变得越来越复杂,其硬件执行平台也随着规模的扩大而变得越来越异构。软件系统可能由于某些故障而失败,例如节点故障,通信故障或数据处理故障。在本文中,我们提出了一种基于环的弹性机制,该机制可实现故障检测和恢复。 (1)为解决中央服务器网络流量负担过大的问题,我们设计了一种基于环的心跳算法进行故障检测。 (2)与当前的系统特定机制相比,我们还设计了一种轻量级的恢复机制以从崩溃故障中恢复。为了评估我们的机制,我们使用3D在线游戏系统作为案例研究。通过注入故障,我们测试了所提出机制的有效性和开销。与其他机制相比,实验结果表明我们的机制可以很好地支持弹性,并且在可接受的开销下,能够更好地处理由高集群工作量引起的崩溃故障。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号