Distributed fault-tolerance for large multiprocessor systems

机译：大型多处理器系统的分布式容错

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Techniques for dealing with hardware failures in very large networks of distributed processing elements are presented. A concept known as distributed fault-tolerance is introduced. A model of a large multiprocessor system is developed and techniques, based on this model, are given by which each processing element can correctly diagnose failures in all other processing elements in the system. The effect of varying system interconnection structures upon the extent and efficiency of the diagnosis process is discussed, and illustrated with an example of an actual system.

Finally, extensions to the model, which render it more realistic, are given and a modified version of the diagnosis procedure is presented which operates under this model.

机译：提出了在大型分布式处理元件网络中处理硬件故障的技术。引入了一种称为分布式容错的概念。开发了大型多处理器系统的模型，并基于该模型给出了一些技术，通过这些技术，每个处理元件都可以正确诊断系统中所有其他处理元件的故障。讨论了各种系统互连结构对诊断过程的程度和效率的影响，并以一个实际系统为例进行了说明。

最后，给出了对该模型的扩展，使其更加逼真，并提出了在该模型下运行的诊断程序的修改版本。展开▼

著录项

来源
《Annual symposium on Computer Architecture;Symposium on Computer Architecture》|1980年|P.23-30|共8页
会议地点
作者
J. G. Kuhl; S. M. Reddy;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Fault-tolerance through scheduling of aperiodic tasks in hard real-time multiprocessor systems [J] . Ghosh S., Melhem R. IEEE Transactions on Parallel and Distributed Systems . 1997,第3期

机译：通过调度硬实时多处理器系统中的非周期性任务来实现容错
2. Fault-Tolerance In a Multiprocessor, Digital Switching System [J] . De Bimal B., Krakau Herbert B. Reliability, IEEE Transactions on . 1981,第3期

机译：多处理器数字交换系统中的容错
3. Method for Choosing a Balanced Set of Fault-Tolerance Techniques for Distributed Computer Systems [J] . D. Yu. Volkanov Automatic Control and Computer Sciences . 2017,第7期

机译：用于为分布式计算机系统选择平衡的容错技术组的方法
4. Efficient fault-tolerance for iterative graph processing on distributed dataflow systems [C] . Chen Xu, Markus Holzemer, Manohar Kaul, IEEE International Conference on Data Engineering . 2016

机译：分布式数据流系统上迭代图处理的高效容错
5. Fault-tolerance for real-time multiprocessor operating systems. [D] . Knight, George Scott. 1993

机译：实时多处理器操作系统的容错。
6. Implementing a Chaotic Cryptosystem by Performing Parallel Computing on Embedded Systems with Multiprocessors [O] . Abraham Flores-Vergara, Everardo Inzunza-González, Enrique Efren García-Guerrero, 2019

机译：通过在具有多处理器的嵌入式系统上执行并行计算来实现混沌密码系统
7. A Fault-Tolerance Model for Multiprocessor Real-Time Systems [O] . Cheng Sheng-Tzong, Chen Chia-Mei, Tripathi Satish K. 2000

机译：多处理器实时系统的容错模型
8. Fault-Tolerance in Distributed and Multiprocessor Real Time Systems [R] . Pradhan, D. K. 1995

机译：分布式多处理器实时系统的容错性

Distributed fault-tolerance for large multiprocessor systems

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅