首页> 外文会议>International University Communication Symposium >Optimization of multi-join query processing within MapReduce
【24h】

Optimization of multi-join query processing within MapReduce

机译:MapReduce中多连接查询处理的优化

获取原文

摘要

MapReduce is a programming model which is usually applied to process large-scale data. Many tasks can be implemented under the framework, such as data processing of search engines and machine learning. However, there is no efficient support for join operation in current implementations of MapReduce. Former work has studied Map-Reduce-Merge for join operator, however, because of the time cost in the Reduce phase, we argue it is better to omit the Reduce procedure along with the cost it brings for join implementation. In this paper, we design and implement a join algorithm on relational data in a MapReduce environment. Meanwhile, we present a method for join operator over many relations. We conduct a series of experiments to verify the effectiveness and efficiency of proposed methods.
机译:MapReduce是一个编程模型,通常应用于处理大规模数据。许多任务可以在框架下实现,例如搜索引擎和机器学习的数据处理。但是,在MapReduce的当前实现中没有有效支持加入操作。以前的工作已经研究了加入操作员的地图减少合并,但是,由于降低阶段的时间成本,我们认为更好的是省略减少程序以及它带来加入实现的成本。在本文中,我们在MapReduce环境中设计和实现了一种关于关系数据的连接算法。同时,我们在许多关系中提出了一种加入操作员的方法。我们进行一系列实验来验证所提出的方法的有效性和效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号