...
首页> 外文期刊>Parallel and Distributed Systems, IEEE Transactions on >Resilient Self-Compressive Monitoring for Large-Scale Hosting Infrastructures
【24h】

Resilient Self-Compressive Monitoring for Large-Scale Hosting Infrastructures

机译:大型托管基础架构的弹性自压缩监控

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Large-scale hosting infrastructures have become the fundamental platforms for many real-world systems such as cloud computing infrastructures, enterprise data centers, and massive data processing systems. However, it is a challenging task to achieve both scalability and high precision while monitoring a large number of intranode and internode attributes (e.g., CPU usage, free memory, free disk, internode network delay). In this paper, we present the design and implementation of a Resilient self-Compressive Monitoring (RCM) system for large-scale hosting infrastructures. RCM achieves scalable distributed monitoring by performing online data compression to reduce remote data collection cost. RCM provides failure resilience to achieve robust monitoring for dynamic distributed systems where host and network failures are common. We have conducted extensive experiments using a set of real monitoring data from NCSU's virtual computing lab (VCL), PlanetLab, a Google cluster, and real Internet traffic matrices. The experimental results show that RCM can achieve up to 200 percent higher compression ratio and several orders of magnitude less overhead than the existing approaches.
机译:大型托管基础架构已成为许多现实世界系统(如云计算基础架构,企业数据中心和海量数据处理系统)的基本平台。但是,在监视大量节点内和节点间属性(例如,CPU使用率,可用内存,可用磁盘,节点间网络延迟)的同时实现可伸缩性和高精度是一项艰巨的任务。在本文中,我们介绍了用于大型托管基础架构的弹性自压缩监控(RCM)系统的设计和实现。 RCM通过执行在线数据压缩以减少远程数据收集成本来实现可扩展的分布式监视。 RCM提供故障恢复能力,以针对主机和网络故障常见的动态分布式系统实现强大的监控。我们使用来自NCSU虚拟计算实验室(VCL),PlanetLab,Google集群和真实Internet流量矩阵的一组真实监控数据进行了广泛的实验。实验结果表明,与现有方法相比,RCM可以实现高达200%的更高压缩比,且开销减少几个数量级。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号