首页> 外文会议>2018 IEEE 23rd Pacific Rim International Symposium on Dependable Computing >Do Nothing, But Carefully: Fault Tolerance with Timing Guarantees for Multiprocessor Systems Devoid of Online Adaptation
【24h】

Do Nothing, But Carefully: Fault Tolerance with Timing Guarantees for Multiprocessor Systems Devoid of Online Adaptation

机译:无所事事,但要小心:没有在线适应性的多处理器系统具有时序保证的容错能力

获取原文
获取原文并翻译 | 示例

摘要

Many practical real-time systems must be able to sustain several reliability threats induced by their physical environments that cause short-term abnormal system behavior, such as transient faults. To cope with this change of system behavior, online adaptions, which may introduce a high computation overhead, are performed in many cases to ensure the timeliness of the more important tasks while no guarantees are provided for the less important tasks. In this work, we propose a system model which does not require any online adaption, but, according to the concept of dynamic real-time guarantees, provides full timing guarantees as well as limited timing guarantees, depending on the system behavior. For the normal system behavior, timeliness is guaranteed for all tasks; otherwise, timeliness is guaranteed only for the more important tasks while bounded tardiness is ensured for the less important tasks. Aiming to provide such dynamic timing guarantees, we propose a suitable system model and discuss, how this can be established by means of partitioned as well as semi-partitioned strategies. Moreover, we propose an approach for handling abnormal behavior with a longer duration, such as intermittent faults or overheating of processors, by performing task migration in order to compensate the affected system component and to increase the system's reliability. We show by comprehensive experiments that good acceptance ratios can be achieved under partitioned scheduling, which can be further improved under semi-partitioned strategies. In addition, we demonstrate that the proposed migration techniques lead to a reasonable trade-off between the decrease in schedulability and the gain in robustness of the system. The presented approaches can also be applied to mixed-criticality systems with two criticality levels.
机译:许多实际的实时系统必须能够承受由其物理环境引起的几种可靠性威胁,这些威胁会引起短期的异常系统行为,例如瞬态故障。为了应对这种系统行为的变化,在许多情况下会执行在线调整,这可能会导致较高的计算开销,以确保较重要的任务的及时性,而对于较不重要的任务则无法提供保证。在这项工作中,我们提出一种不需要任何在线适应的系统模型,但是根据动态实时保证的概念,根据系统行为,可以提供完整的时序保证和有限的时序保证。对于正常的系统行为,可以保证所有任务的及时性。否则,仅对于较重要的任务才保证及时性,而对于较不重要的任务则要保证有限的迟到性。为了提供这样的动态时序保证,我们提出了一个合适的系统模型,并讨论了如何通过分区和半分区策略来建立这种模型。此外,我们提出了一种通过执行任务迁移来处理较长时间的异常行为(如间歇性故障或处理器过热)的方法,以补偿受影响的系统组件并提高系统的可靠性。我们通过综合实验表明,在分区调度下可以实现良好的接受率,而在半分区策略下可以进一步提高接受率。此外,我们证明了所提出的迁移技术导致可调度性的降低与系统健壮性的提高之间的合理权衡。所提出的方法也可以应用于具有两个临界水平的混合临界系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号