首页> 外文期刊>IEEE Journal on Selected Areas in Communications >Greener Data Exchange in the Cloud: A Coding-Based Optimization for Big Data Processing
【24h】

Greener Data Exchange in the Cloud: A Coding-Based Optimization for Big Data Processing

机译:云中的绿色数据交换:基于编码的大数据处理优化

获取原文
获取原文并翻译 | 示例
           

摘要

The rise of the cloud and distributed data-intensive (big data) applications puts pressure on data center networks due to the movement of massive volumes of data. Reducing the volume of communication is pivotal for embracing greener data exchange by efficient utilization of network resources. This paper proposes the use of mixing technique, , working in tandem with software-defined network control as a means of dynamically-controlled reduction in volume of communication. We introduce motivating real-world use-cases, and present a novel algorithm for the data center networks. We also analyze the computational complexity of the general problem of minimizing the volume of communication in a distributed data center application without degrading the rate of information exchange, and provide theoretical limits of such schemes. Moreover, we proceed to bridge the gap between theory and practice by performing a proof-of-concept implementation of the proposed system in a real world data center. We use Hadoop MapReduce, the most widely used big data processing framework, as our target. The experimental results employing two of industry standard benchmarks show the advantage of our proposed system compared to a , an , and . The proposed coding-based scheme shows performance improvement in terms of volume of communication (up to 62%), goodput (up to 76%), disk utilization (up to 38%), and the number of bits that can be transmitted per Joule of energy (up to 200%).
机译:由于海量数据的移动,云和分布式数据密集型(大数据)应用程序的兴起给数据中心网络带来了压力。减少通信量对于通过有效利用网络资源来拥抱绿色数据交换至关重要。本文提出了使用混合技术,与软件定义的网络控制协同工作,以动态控制通信量减少的方法。我们介绍了激励现实的用例,并提出了一种用于数据中心网络的新颖算法。我们还分析了在不降低信息交换速率的情况下使分布式数据中心应用程序中的通信量最小化这一普遍问题的计算复杂性,并提供了此类方案的理论限制。此外,我们通过在现实世界的数据中心中对所提出的系统进行概念验证实施,来弥合理论与实践之间的鸿沟。我们使用最广泛使用的大数据处理框架Hadoop MapReduce作为我们的目标。使用两个行业标准基准的实验结果表明,与a,an和相比,我们提出的系统具有优势。提议的基于编码的方案在通信量(高达62%),吞吐量(高达76%),磁盘利用率(高达38%)以及每焦耳可以传输的位数方面显示出性能的提高。的能量(高达200%)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号