首页> 外文会议>8th IEEE International Conference on e-Business Engineering >CREST: Towards Fast Speculation of Straggler Tasks in MapReduce
【24h】

CREST: Towards Fast Speculation of Straggler Tasks in MapReduce

机译:CREST:迈向MapReduce中的Straggler任务快速推测

获取原文

摘要

Data-Intensive Computing emerges as the fourth paradigm for modern scientific discoveries. MapReduce, a programming paradigm for large-scale data-parallel applications, is widely applied to web indexing, machine learning, and scientific simulations in industries as well as in academia. Recently, the virtualized "utility computing" environments, such as campus cloud, are becoming an important scenario to run MapReduce jobs. For a MapReduce job, the straggler tasks may dominate the response time and delay whole job. Various speculation schemes have been proposed to alleviate such problem, however, most of them implicitly assume that the time cost for data movement on launching speculative map tasks is trivial, which does not always hold for the virtualized Hadoop clusters in a campus cloud. In this paper, we propose a novel approach, CREST(Combination Re-Execution Scheduling Technology), which can achieve the optimal running time for speculative map tasks and decrease the response time of MapReduce jobs. The main idea is that re-executing a combination of tasks on a group of computing nodes may progress faster than directly speculating the straggler task on target node, due to data locality. The evaluation validates our approach and demonstrates that CREST can reduce the running time of a speculative map task by 70% with best cases and 50% on average, comparing with LATE.
机译:数据密集型计算成为现代科学发现的第四个范例。 MapReduce是用于大规模数据并行应用程序的编程范例,已广泛应用于工业和学术界中的Web索引,机器学习和科学模拟。最近,虚拟化的“实用程序计算”环境(例如校园云)正在成为运行MapReduce作业的重要方案。对于MapReduce作业,散布任务可能会控制响应时间并延迟整个作业。已经提出了各种推测方案来减轻这种问题,但是,大多数方案隐式地假设启动推测性地图任务时数据移动的时间成本是微不足道的,对于校园云中的虚拟化Hadoop集群而言,这并不总是适用的。在本文中,我们提出了一种新颖的方法CREST(组合重新执行调度技术),该方法可以实现推测性地图任务的最佳运行时间,并减少MapReduce作业的响应时间。主要思想是,由于数据的局部性,与直接在目标节点上推测散乱任务相比,在一组计算节点上重新执行任务组合的进度可能更快。该评估验证了我们的方法,并证明与LATE相比,CREST可以将最佳情况下的投机图任务的运行时间减少70%,平均减少50%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号