Parallelizing Structural Joins to Process Queries over Big XML Data Using MapReduce

机译：使用MapReduce并行化结构化联接以处理基于XML的大XML数据

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Processing XML queries over big XML data using MapReduce has been studied in recent years. However, the existing works focus on partitioning XML documents and distributing XML fragments into different compute nodes. This attempt may introduce high overhead in XML fragment transferring from one node to another during MapReduce execution. Motivated by the structural join based XML query processing approach, which uses only related inverted lists to process queries in order to reduce I/O cost, we propose a novel technique to use MapReduce to distribute labels in inverted lists in a computing cluster, so that structural joins can be parallelly performed to process queries. We also propose an optimization technique to reduce the computing space in our framework, to improve the performance of query processing. Last, we conduct experiment to validate our algorithms.

机译：近年来，已经研究了使用MapReduce处理大型XML数据上的XML查询。但是，现有的工作集中在对XML文档进行分区以及将XML片段分布到不同的计算节点上。这种尝试可能会在MapReduce执行期间从一个节点到另一个节点的XML片段传输中引入高开销。基于基于结构连接的XML查询处理方法的动机，该方法仅使用相关的反向列表来处理查询以降低I / O成本，因此我们提出了一种新颖的技术，该方法使用MapReduce在计算集群中的反向列表中分配标签，从而可以并行执行结构化连接以处理查询。我们还提出了一种优化技术，以减少我们框架中的计算空间，以提高查询处理的性能。最后，我们进行实验以验证算法。

著录项

来源
《International conference on database and expert systems applications》|2014年|183-190|共8页
会议地点
作者
Huayu Wu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Grid-Based Parallel Algorithms of Join Queries for Analyzing Multi-Dimensional Data on MapReduce [J] . Miyoung JANG, Jae-Woo CHANG IEICE transactions on information and systems . 2018,第4期

机译：MapReduce上多维数据分析的基于网格的联合查询并行算法
2. A Study on Parallel Holistic Tbig Join for XML Query Processing [J] . Imam Machdi 情報処理 . 2011,第10期

机译：XML查询处理的并行整体Tbig Join研究
3. Handling distributed XML queries over large XML data based on MapReduce framework [J] . Fan Hongjie, Ma Zhiyi, Wang Dianhui, Information Sciences: An International Journal . 2018,第期

机译：根据MapReduce框架处理在大型XML数据上的分布式XML查询
4. Parallelizing Structural Joins to Process Queries over Big XML Data Using MapReduce [C] . Huayu Wu International Conference on Database and Expert Systems Applications . 2014

机译：并行化结构连接以使用MapReduce对大XML数据进行查询
5. Query processing and optimization for structural selection queries over XML data. [D] . Vagena, Zografoula. 2005

机译：针对XML数据的结构选择查询的查询处理和优化。
6. Using XML Metadata to Enable the Automatic Generation and Processing of HTML Forms from XML Documents [O] . Anil K. Dubey, Henry C. Chueh 2001

机译：使用XML元数据启用从XML文档自动生成和处理HTML表单的功能
7. A study on parallel holistic twig joins for XML query processing [O] . Machdi Imam 2010

机译：XML查询处理的并行整体枝连接研究
8. Interactive Query Processing in Big Data Systems: A Cross Industry Study of MapReduce Workloads. [R] . R. H. Katz S. Alspaugh Y. Chen 2012

机译：大数据系统中的交互式查询处理：mapReduce工作负载的跨行业研究。

Parallelizing Structural Joins to Process Queries over Big XML Data Using MapReduce

摘要

著录项

相似文献

相关主题

期刊订阅