首页> 外文学位 >Fast low-cost failure recovery for real-time communication in multi-hop networks.
【24h】

Fast low-cost failure recovery for real-time communication in multi-hop networks.

机译:快速的低成本故障恢复,可在多跳网络中进行实时通信。

获取原文
获取原文并翻译 | 示例

摘要

Best-effort communication is inadequate for QoS-sensitive applications (like multimedia), since such applications require bounded message delay and predictable throughput. Instead, real-time communication which can provide QoS guarantees by resource reservation has been actively researched. As the application domain of real-time communication expands to include business- or mission-critical applications, network dependability becomes essential.; This dissertation addresses how to make real-time communication dependable. We have developed an integrated scheme for restoring real-time connections from network component failures. As applications with different dependability requirements share the same network, the dependability level and its associated cost should be flexibly chosen depending on the criticality of applications. Our scheme is based on five key design principles: per-connection dependability guarantee, fast failure recovery, small fault-tolerance overhead, robust failure handling, and high interoperability and scalability.; To quickly restore failed connections, cold-standby backup channels are set up in advance along with each primary channel. Upon failure of a primary channel, one of its backups is promoted to replace the primary channel. To minimize the resource overhead in maintaining backup channels, resources for backups are shared judiciously so that connection dependability may not be compromised. By choosing the degree of resource sharing and the number of backups, the network can control the dependability of a connection in accordance with the application's request.; Our scheme covers all aspects of connection failure recovery such as backup routing, failure detection, channel switching, and resource reconfiguration after failure recovery. Particularly, we develop two behavior-based failure-detection schemes that do not require any special hardware support, and experimentally evaluate their effectiveness using a testbed implementation. We also develop a novel protocol that provides distributed and robust handling of detected failures. Good coverage in recovering from failures is shown to be achievable with low degradation in network utilization under reasonable failure conditions.; Our distributed architecture scales well, and the procedures of backup establishment, failure detection, and channel switching are independent of the underlying communication system so that our scheme is interoperable with various real-time communication schemes.
机译:对于QoS敏感的应用程序(如多媒体),尽力而为的通信是不够的,因为此类应用程序需要有限的消息延迟和可预测的吞吐量。取而代之的是,已经积极研究了可以通过资源预留来提供QoS保证的实时通信。随着实时通信的应用领域扩展到包括业务或任务关键型应用程序,网络可靠性变得至关重要。本文探讨了如何使实时通信可靠。我们已开发出一种用于从网络组件故障中恢复实时连接的集成方案。由于具有不同可靠性要求的应用程序共享同一网络,因此应根据应用程序的关键程度灵活选择可靠性级别及其相关成本。我们的方案基于五个关键设计原则:每个连接的可靠性保证,快速的故障恢复,较小的容错开销,可靠的故障处理以及较高的互操作性和可伸缩性。为了快速恢复失败的连接,预先与每个主通道一起设置了冷备用备份通道。主通道发生故障时,将升级其备份之一以替换主通道。为了最大程度地减少维护备份通道的资源开销,请谨慎共享备份资源,以免影响连接可靠性。通过选择资源共享程度和备份数量,网络可以根据应用程序的请求来控制连接的可靠性。我们的方案涵盖了连接故障恢复的所有方面,例如备份路由,故障检测,通道切换以及故障恢复后的资源重新配置。特别是,我们开发了两个不需要任何特殊硬件支持的基于行为的故障检测方案,并使用测试平台实现对它们的有效性进行了实验评估。我们还开发了一种新颖的协议,该协议可对检测到的故障进行分布式且强大的处理。事实证明,在合理的故障条件下,通过良好的覆盖范围可以实现故障恢复中的网络利用率低的降低。我们的分布式体系结构可以很好地扩展,并且备份建立,故障检测和通道切换的过程与底层通信系统无关,因此我们的方案可与各种实时通信方案互操作。

著录项

  • 作者

    Han, Seungjae.;

  • 作者单位

    University of Michigan.;

  • 授予单位 University of Michigan.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 1998
  • 页码 135 p.
  • 总页数 135
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号