首页> 外文会议>IEEE International Symposium on Real-Time Distributed Computing >Tolerating Transient Late-Timing Faults in Cloud-Based Real-Time Stream Processing
【24h】

Tolerating Transient Late-Timing Faults in Cloud-Based Real-Time Stream Processing

机译:容忍基于云的实时流处理中的瞬时后期错误

获取原文

摘要

Real-time stream processing is a frequently deployed application within Cloud datacenters that is required to provision high levels of performance and reliability. Numerous fault-tolerant approaches have been proposed to effectively achieve this objective in the presence of crash failures. However, such systems struggle with transient late-timing faults - a fault classification challenging to effectively tolerate - that manifests increasingly within large-scale distributed systems. Such faults represent a significant threat towards minimizing soft real-time execution of streaming applications in the presence of failures. This work proposes a fault-tolerant approach for QoS-aware data prediction to tolerate transient late-timing faults. The approach is capable of determining the most effective data prediction algorithm for imposed QoS constraints on a failed stream processor at run-time. We integrated our approach into Apache Storm with experiment results showing its ability to minimize stream processor end-to-end execution time by 61% compared to other fault-tolerant approaches. The approach incurs 12% additional CPU utilization while reducing network usage by 44%.
机译:实时流处理是Cloud数据中心内经常部署的应用程序,需要提供高水平的性能和可靠性。已经提出了许多容错方法,以在发生碰撞故障时有效地实现该目的。但是,这样的系统在瞬态后期定时故障(一种难以有效容忍的故障分类)中挣扎,这种故障在大型分布式系统中越来越多地表现出来。此类故障对在出现故障的情况下最小化流应用程序的软实时执行构成了重大威胁。这项工作提出了一种用于QoS感知数据预测的容错方法,以容忍瞬时后定时故障。该方法能够在运行时确定对故障流处理器施加的QoS约束的最有效数据预测算法。我们通过实验结果将我们的方法集成到了Apache Storm中,实验结果表明,与其他容错方法相比,该方法能够将流处理器的端到端执行时间最小化61%。该方法可将CPU使用率提高12%,同时将网络使用率降低44%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号