【24h】

Conch: A Cyclic MapReduce Model for Iterative Applications

机译:海螺:用于迭代应用程序的循环MapReduce模型

获取原文

摘要

MapReduce programming model is a popular model to simplify but speed up data parallel applications. However, it is not efficient for iterative applications because of its repeated data transmission with HDFS (Hadoop Distributed File System). Conch, a cyclic MapReduce model, is designed for efficient processing of iterative applications. In order to minimize network overhead, shared data is cached locally and a "map-shuffle" phase is presented with a combined transmission mechanism. Meanwhile, a prediction scheduler for iterative applications is brought out to achieve better data locality in terms of runtime information. The experiments show that Conch can support iterative applications transparently and efficiently. Compared with Hadoop and HaLoop in single-job environment, Conch can achieve 13%-17% improvements on K-Means and fuzzy C-Means. Especially in multi-job environment, 63.6% and 28.6% improvements can be obtained compared with Hadoop and HaLoop.
机译:MapReduce编程模型是一种流行的模型,可以简化但加快数据并行应用程序的速度。但是,由于它与HDFS(Hadoop分布式文件系统)重复进行数据传输,因此对于迭代应用程序而言效率不高。 Conch是一个循环MapReduce模型,旨在有效处理迭代应用程序。为了最小化网络开销,共享数据在本地缓存,并通过组合的传输机制呈现“映射混洗”阶段。同时,针对运行时信息,推出了用于迭代应用程序的预测调度程序,以实现更好的数据局部性。实验表明,Conch可以透明有效地支持迭代应用程序。与单作业环境中的Hadoop和HaLoop相比,Conch在K均值和模糊C均值方面可实现13%-17%的改进。特别是在多作业环境中,与Hadoop和HaLoop相比,可以分别提高63.6%和28.6%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号