首页> 外文学位 >System reliability through algorithm-based fault tolerance and reconfiguration.
【24h】

System reliability through algorithm-based fault tolerance and reconfiguration.

机译:通过基于算法的容错和重新配置来提高系统可靠性。

获取原文
获取原文并翻译 | 示例

摘要

With computers being used in critical and life-impacting applications, system reliability becomes vital. Fault-Tolerance is a proven approach to improve reliability of computer systems. In this thesis we have studied the Reconfiguration and the Algorithm-Based Fault-tolerance (ABFT) techniques.; The ABFT techniques tolerate faults at system level. These techniques allow user to decide the degree of fault-tolerance needed. Achieving fault-tolerance under these techniques is also cost-effective. For these principal reasons, the ABFT techniques have been researched actively to apply to several numerical algorithms. Typically, in ABFT approach the input data for an algorithm are encoded to locate or detect errors. The number of redundant computations involved in the encoded data has to be bounded. In our research, we have improved the existing bounds on the redundant computations for many of the problem categories under the ABFT techniques.; The new bounds for error-detections are derived using Latin Square (LS) arrangements. This is the first time LS has been applied for these problems. These bounds show significant improvement over the existing bounds. We have derived the bounds for both P and {dollar}Psb{lcub}g{rcub}{dollar} models for this family of problems.; We have also studied the bounds for error-location problems of ABFT techniques. The results presented in this thesis are the first ever for this category of problems. We have applied Chinese Reminder Theorem in unique ways to derive these bounds.; Our research also includes formulation of a new family of problems, i.e., error-location and error-detection. We have obtained bounds for a special case under this new category for both P and {dollar}Psb{lcub}g{rcub}{dollar} models.; The Wafer Scale Integrated (WSI) technology is holding promises for future demand on computational powers. The WSI Augmented Processor Array proposed earlier this decade has a balance between overhead of spare processors and tolerance for faults. Our contribution to this topic is in designing an efficient static reconfiguration algorithm.
机译:随着计算机被用于影响生命的关键应用中,系统可靠性变得至关重要。容错是一种提高计算机系统可靠性的行之有效的方法。本文研究了重构和基于算法的容错技术。 ABFT技术可容忍系统级别的故障。这些技术允许用户确定所需的容错程度。在这些技术下实现容错也是经济有效的。由于这些主要原因,已经积极研究了ABFT技术以应用于几种数值算法。通常,在ABFT方法中,对算法的输入数据进行编码以定位或检测错误。必须限制编码数据中涉及的冗余计算的数量。在我们的研究中,我们改进了ABFT技术下许多问题类别的冗余计算的现有界限。使用拉丁广场(LS)排列可以得出错误检测的新范围。这是LS首次应用于这些问题。这些界限显示了对现有界限的显着改进。我们已经为这一系列问题得出了P和{dols} Psb {lcub} g {rcub} {dollar}模型的界限。我们还研究了ABFT技术的错误定位问题的范围。本文提出的结果是此类问题的第一个结果。我们以独特的方式应用了中国提醒定理来推导这些界限。我们的研究还包括提出一系列新的问题,即错误定位和错误检测。我们已经在P和{dollar} Psb {lcub} g {rcub} {dollar}模型的新类别下获得了特殊情况的界限。晶圆级集成(WSI)技术对未来对计算能力的需求充满希望。本世纪初提出的WSI增强处理器阵列在备用处理器的开销与故障容忍度之间取得了平衡。我们对该主题的贡献在于设计一种有效的静态重新配置算法。

著录项

  • 作者

    Ramanathan, Gowri.;

  • 作者单位

    Oregon State University.;

  • 授予单位 Oregon State University.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 1998
  • 页码 115 p.
  • 总页数 115
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号