...
首页> 外文期刊>International journal of software engineering and knowledge engineering >Reliability and Performance Analysis of Architecture-Based Software Implementing Restarts and Retries Subject to Correlated Component Failures
【24h】

Reliability and Performance Analysis of Architecture-Based Software Implementing Restarts and Retries Subject to Correlated Component Failures

机译:基于架构的软件的可靠性和性能分析,该重启后重试会发生相关组件故障

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

High reliability and performance are essential attributes of software systems designed for critical real-time applications. To improve the reliability and performance of software, many systems incorporate some form of fault recovery mechanism. However, contemporary models of software reliability and performance rarely consider these fault recovery mechanisms. Another notable shortcoming of many software models is that they make the simplifying assumption that component failures are statistically independent, which disagrees with several experimental studies that have shown that the failures of software components can exhibit correlation. This paper presents an architecture-based model of software reliability and performance that explicitly considers a two-stage fault recovery mechanism implementing component restarts and application-level retries. The application architecture is characterized by a Discrete Time Markov Chain (DTMC) to represent the dynamic branching behavior of control between the components of the application. Correlations between the component failures are computed with an efficient numerical algorithm for a multivariate Bernoulli (MVB) distribution. We illustrate the utility of the model through a case study of an embedded software application. The results suggest that the model can be used to quantify the impact of software fault recovery and correlated component failures on application reliability and performance.
机译:高可靠性和高性能是为关键实时应用设计的软件系统的基本属性。为了提高软件的可靠性和性能,许多系统都采用了某种形式的故障恢复机制。但是,当代的软件可靠性和性能模型很少考虑这些故障恢复机制。许多软件模型的另一个显着缺点是,它们简化了组件故障在统计上是独立的假设,这与一些实验研究不同,后者表明软件组件的故障可以表现出相关性。本文提出了一种基于体系结构的软件可靠性和性能模型,该模型明确考虑了实现组件重新启动和应用程序级重试的两阶段故障恢复机制。应用程序体系结构的特征在于离散时间马尔可夫链(DTMC),以表示应用程序组件之间控件的动态分支行为。使用多元伯努利(MVB)分布的有效数值算法来计算组件故障之间的相关性。我们通过一个嵌入式软件应用程序的案例研究来说明该模型的实用性。结果表明,该模型可用于量化软件故障恢复和相关组件故障对应用程序可靠性和性能的影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号