...
首页> 外文期刊>Network, IEEE >Aggregation on the fly: reducing traffic for big data in the cloud
【24h】

Aggregation on the fly: reducing traffic for big data in the cloud

机译:实时聚合:减少云中大数据的流量

获取原文
获取原文并翻译 | 示例

摘要

As a leading framework for processing and analyzing big data, MapReduce is leveraged by many enterprises to parallelize their data processing on distributed computing systems. Unfortunately, the all-to-all data forwarding from map tasks to reduce tasks in the traditional MapReduce framework would generate a large amount of network traffic. The fact that the intermediate data generated by map tasks can be combined with significant traffic reduction in many applications motivates us to propose a data aggregation scheme for MapReduce jobs in cloud. Specifically, we design an aggregation architecture under the existing MapReduce framework with the objective of minimizing the data traffic during the shuffle phase, in which aggregators can reside anywhere in the cloud. Some experimental results also show that our proposal outperforms existing work by reducing the network traffic significantly.
机译:作为处理和分析大数据的领先框架,许多企业利用MapReduce在分布式计算系统上并行化其数据处理。不幸的是,从地图任务进行全部数据转发以减少传统MapReduce框架中的任务会产生大量的网络流量。由地图任务生成的中间数据可以与许多应用程序中的大量流量减少相结合的事实促使我们为云中的MapReduce作业提出一种数据聚合方案。具体来说,我们在现有的MapReduce框架下设计了一个聚合体系结构,其目的是在混洗阶段将聚合器可以驻留在云中的任何位置,从而将数据流量最小化。一些实验结果还表明,我们的建议通过显着减少网络流量来胜过现有工作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号