...
首页> 外文期刊>Cloud Computing, IEEE Transactions on >Dependency-Aware Data Locality for MapReduce
【24h】

Dependency-Aware Data Locality for MapReduce

机译:MapReduce的依赖关系感知数据局部性

获取原文
获取原文并翻译 | 示例
           

摘要

MapReduce effectively partitions and distributes computation workloads to a cluster of servers, facilitating today’s big data processing. Given the massive data to be dispatched, and the intermediate results to be collected and aggregated, there have been a significant studies onndata localitynthat seeks to co-locate computation with data, so as to reduce cross-server traffic in MapReduce. They generally assume that the input data have little dependency with each other, which however is not necessarily true for that of many real-world applications, and we show strong evidence that the finishing time of MapReduce tasks can be greatly prolonged with such data dependency. In this paper, we present Dependency-Aware Locality for MapReduce (DALM) for processing the real-world input data that can be highly skewed and dependent. DALM accommodates data-dependency in a data-locality framework, organically synthesizing the key components from data reorganization, replication, placement. Beside algorithmic design within the framework, we have also closely examined the deployment challenges, particularly in public virtualized cloud environments, and have implemented DALM on Hadoop 1.2.1 with Giraph 1.0.0. Its performance has been evaluated through both simulations and real-world experiments, and compared with that of state-of-the-art solutions.
机译:MapReduce有效地将计算工作负载分区并分配到服务器集群中,以促进当今的大数据处理。鉴于要分发的海量数据,以及要收集和汇总的中间结果,对n 数据位置 n,该位置试图将计算与数据并置在一起,以减少MapReduce中的跨服务器流量。他们通常假设输入数据彼此之间几乎没有依赖性,但是对于许多实际应用程序并不一定如此,并且我们有力的证据表明,使用这种数据依赖性可以大大延长MapReduce任务的完成时间。在本文中,我们提出了MapReduce的依赖关系局部性(DALM),用于处理可能高度偏向和依赖的现实输入数据。 DALM在数据局部性框架中适应了数据依赖性,有机地综合了来自数据重组,复制和放置的关键组件。除了框架中的算法设计之外,我们还仔细研究了部署挑战,尤其是在公共虚拟化云环境中,并且已经在具有Giraph 1.0.0的Hadoop 1.2.1上实现了DALM。它的性能已通过仿真和实际实验进行了评估,并与最新解决方案进行了比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号