首页> 外文会议>International conference on database and expert systems applications >Parallelizing Structural Joins to Process Queries over Big XML Data Using MapReduce
【24h】

Parallelizing Structural Joins to Process Queries over Big XML Data Using MapReduce

机译:使用MapReduce并行化结构化联接以处理基于XML的大XML数据

获取原文

摘要

Processing XML queries over big XML data using MapReduce has been studied in recent years. However, the existing works focus on partitioning XML documents and distributing XML fragments into different compute nodes. This attempt may introduce high overhead in XML fragment transferring from one node to another during MapReduce execution. Motivated by the structural join based XML query processing approach, which uses only related inverted lists to process queries in order to reduce I/O cost, we propose a novel technique to use MapReduce to distribute labels in inverted lists in a computing cluster, so that structural joins can be parallelly performed to process queries. We also propose an optimization technique to reduce the computing space in our framework, to improve the performance of query processing. Last, we conduct experiment to validate our algorithms.
机译:近年来,已经研究了使用MapReduce处理大型XML数据上的XML查询。但是,现有的工作集中在对XML文档进行分区以及将XML片段分布到不同的计算节点上。这种尝试可能会在MapReduce执行期间从一个节点到另一个节点的XML片段传输中引入高开销。基于基于结构连接的XML查询处理方法的动机,该方法仅使用相关的反向列表来处理查询以降低I / O成本,因此我们提出了一种新颖的技术,该方法使用MapReduce在计算集群中的反向列表中分配标签,从而可以并行执行结构化连接以处理查询。我们还提出了一种优化技术,以减少我们框架中的计算空间,以提高查询处理的性能。最后,我们进行实验以验证算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号