首页> 外文期刊>Future generation computer systems >Cross-MapReduce: Data transfer reduction in geo-distributed MapReduce
【24h】

Cross-MapReduce: Data transfer reduction in geo-distributed MapReduce

机译:Cross-MapReduce:地理分布式MapReduce中的数据传输减少

获取原文
获取原文并翻译 | 示例
           

摘要

The MapReduce model is widely used to store and process big data in a distributed manner. MapReduce was originally developed for a single tightly coupled cluster of computers. Approaches such as Hierarchical and Geo-Hadoop are designed to address geo-distributed MapReduce processing. However, these methods still suffer from high inter-duster data transfer over the Internet, which is prohibitive for processing today's globally big data. In line with our thinking that there is no need to transfer the entire intermediate results to a single global reducer, we propose Cross-MapReduce, a framework for geo-distributed MapReduce processing. Before any massive data transfer, our proposed method finds a set of best global reducers to minimize transferred data volumes. We propose a graph called Global Reduction Graph (CRG) to determine the number and the locations of the global reducers. We conducted extensive experimental evaluations using a real testbed to demonstrate the effectiveness of Cross-MapReduce. The experimental results show that Cross-MapReduce significantly outperforms the Hierarchical and Geo-Hadoop approaches and reduces the amount of data transfer over the Internet by 40%.
机译:MapReduce模型广泛用于以分布式方式存储和处理大数据。 MapReduce最初是为单个紧密耦合的计算机组开发的。等级和Geo-Hadoop等方法旨在解决地理分布式MapReduce处理。然而,这些方法仍然遭受互联网的高帧间间数据传输,这对当今全球大数据进行了禁止。符合我们认为无需将整个中间结果转移到单个全局减速器,我们提出了Cross-MapReduce,是地理分布式MapReduce处理的框架。在任何大量数据传输之前,我们的提出方法都会找到一组最佳的全局减速器,以最大限度地减少传输的数据卷。我们提出了一个名为全局还原图(CRG)的图形,以确定全局减速器的数量和位置。我们使用真正的试验台进行了广泛的实验评估,以证明交叉映射的有效性。实验结果表明,交叉映射得明显优于等级和地理Hadoop方法,并将互联网上的数据传输量减少40%。

著录项

  • 来源
    《Future generation computer systems》 |2021年第2期|188-200|共13页
  • 作者单位

    Department of Computer Engineering Faculty of Engineering Ferdowsi University of Mashhad Mashhad Iran;

    Department of Computer Engineering Faculty of Engineering Ferdowsi University of Mashhad Mashhad Iran;

    Faculty of Information Technology Monash University Clayton Australia;

    Department of Computer Engineering Faculty of Engineering Ferdowsi University of Mashhad Mashhad Iran;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    MapReduce; Geo-distributed; Data center; Big data;

    机译:mapreduce;地理分布;数据中心;大数据;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号