Dynamic Fault Tolerance in Distributed Simulation System

机译：分布式仿真系统中的动态容错

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Distributed simulation system is widely used for forecasting, decision-making and scientific computing. Multi-agent and Grid have been used as platform for simulation. In order to survive from software or hardware failures and guarantee successful rate during agent migrating, system must solve the fault tolerance problem. Classic fault tolerance technology like checkpoint and redundancy can be used for distributed simulation system, but is not efficient. We present a novel fault tolerance protocol which combines the causal message logging method and prime-backup technology. The proposed protocol uses iterative backup location scheme and adaptive update interval to reduce overhead and balance the cost of fault tolerance and recovery time. The protocol has characteristics of no orphan state, and do not need the survival agents to rollback. Most important is that the recovery scheme can tolerant concurrently failures, even the permanent failure of single node. Correctness of the protocol is proved and experiments show the protocol is efficient.

机译：分布式仿真系统广泛用于预测，决策和科学计算。多代理和电网已被用作模拟平台。为了从软件或硬件故障中生存并保证代理迁移期间成功的速率，系统必须解决容错问题。经典容错技术如检查点和冗余，可用于分布式仿真系统，但不高效。我们提出了一种新颖的容错协议，它结合了因果关系记录方法和Prime-Backup技术。所提出的协议使用迭代备份位置方案和自适应更新间隔来减少开销并平衡容错和恢复时间的成本。该方案具有无孤儿状态的特点，并且不需要存活者来回滚。最重要的是，恢复方案可以容忍同时失败，即使是单个节点的永久性故障也是如此。证明了协议的正确性，实验表明协议是有效的。

著录项

来源
《International Conference on Computational Science pt.1》|2006年||共8页
会议地点
作者
Min Ma; Shiyao Jin; Chaoqun Ye; Xiaojian Liu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Dynamic Distributed Intrusion Detection System Based on Mobile Agents with Fault Tolerance [J] . Sasikumar R., D. Manjula Journal of computer sciences . 2012,第7期

机译：基于容错的移动Agent的动态分布式入侵检测系统
2. Dynamic Distributed Intrusion Detection System Based on Mobile Agents with Fault Tolerance | Science Publications [J] . D. Manjula, R. Sasikumar Journal of computer sciences . 2012,第7期

机译：基于具有容错能力的移动代理的动态分布式入侵检测系统科学出版物
3. A Dynamic Slack Management Technique for Real-Time Distributed Embedded System with Enhanced Fault Tolerance and Resource Constraints [J] . Santhi Baskaran, I. Gugan, A. Aswin Kumar, International Journal on Computer Science and Engineering . 2011,第1期

机译：具有增强的容错能力和资源约束的实时分布式嵌入式系统动态松弛管理技术
4. Dynamic Fault Tolerance in Distributed Simulation System [C] . Min Ma, Shiyao Jin, Chaoqun Ye, International Conference on Computational Science(ICCS 2006) pt.1; 20060528-31; Reading(GB) . 2006

机译：分布式仿真系统中的动态容错
5. Runtime systems for load balancing and fault tolerance on distributed systems. [D] . Arafat, Md Humayun. 2014

机译：运行时系统，用于分布式系统上的负载平衡和容错。
6. Ab initio molecular dynamics simulation of the effects of stacking faults on the radiation response of 3C-SiC [O] . M. Jiang, S. M. Peng, H. B. Zhang, -1

机译：从头算分子动力学模拟堆垛层错对3C-SiC辐射响应的影响
7. Dynamic Distributed Intrusion Detection System Based on Mobile Agents with Fault Tolerance [O] . D. Manjula, R. Sasikumar 2012

机译：基于容错移动代理的动态分布式入侵检测系统

Dynamic Fault Tolerance in Distributed Simulation System

摘要

著录项

相似文献

相关主题

期刊订阅