首页> 外文学位 >Structural join: Processing algorithms and size estimation.
【24h】

Structural join: Processing algorithms and size estimation.

机译:结构连接:处理算法和大小估计。

获取原文
获取原文并翻译 | 示例

摘要

This dissertation is about developing advanced query processing and estimation techniques for database systems managing XML data. More specifically, an important operator in such systems, structural join, is studied. The following two issues related to the current trends in database research are addressed: efficient processing structural joins and estimating the result size of a structural join.; Extensible Markup Language (XML) has become the de facto standard for data representation and exchange over the Internet. It has enabled and stimulated a great multitude of research and applications. However, unique features of XML and its query languages have posed great challenges to efficient managing and querying large volumes of XML data. This, in turn, hinders progress of XML research and the development of applications based on XML. This dissertation makes the attempt to enhance the efficiency of XML database management system by advanced query processing and optimization techniques.; We first consider the efficient processing of structural joins for XML data. A structural join takes two sets of XML nodes as input and returns pairs of nodes such that a special ancestor-descendant relationship holds between them. Structural join is widely accepted as an core operator in XML query processing. An efficient and robust structural query processing framework based on a novel coding scheme, PBiTree coding, is proposed. The PBiTree code enables efficient checking of the ancestor-descendant relationship between two nodes solely based on their PBiTree codes. We present algorithms in the framework that are optimized for various combinations of physical settings. In particular, the newly proposed partitioning based algorithms can process structural joins efficiently without sorting or indexing. Experimental results demonstrate that the structural join processing algorithms based on the proposed coding scheme outperform existing algorithms significantly.; Next, we study the result size estimation problem of structural joins. Estimating the size of structural join result is essential to generating efficient XML query processing plans in an XML query optimizer. We propose two models, the interval model and the position model, under which the original estimation problem can be converted into estimating the size of a spatial join and a relational equijoin respectively. A set of estimation methods based on the histogram and sampling techniques are developed, which have not only high accuracy but also theoretical guarantees on the estimation. Comprehensive performance studies are conducted. The results demonstrate that the accuracy and robustness of our newly proposed estimation methods outperforms those of the previous method up to an order of magnitude.
机译:本文旨在为数据库管理XML数据开发先进的查询处理和估计技术。更具体地说,研究了这种系统中的重要算子,即结构连接。解决了与数据库研究的当前趋势相关的以下两个问题:有效处理结构连接和估计结构连接的结果大小。可扩展标记语言(XML)已成为Internet上数据表示和交换的事实上的标准。它已启用并激发了大量的研究和应用。但是,XML及其查询语言的独特功能对有效管理和查询大量XML数据提出了巨大挑战。反过来,这阻碍了XML研究的进展以及基于XML的应用程序的开发。本文试图通过先进的查询处理和优化技术来提高XML数据库管理系统的效率。我们首先考虑有效处理XML数据的结构化连接。结构化联接将两组XML节点作为输入,并返回节点对,以使它们之间保持特殊的祖先后代关系。结构化联接被广泛接受为XML查询处理中的核心运算符。提出了一种基于新型编码方案PBiTree编码的高效健壮的结构查询处理框架。 PBiTree代码仅基于两个节点的PBiTree代码就可以有效检查两个节点之间的祖先后代关系。我们在框架中介绍了针对物理设置的各种组合进行了优化的算法。特别是,新提出的基于分区的算法可以有效地处理结构连接,而无需进行排序或索引。实验结果表明,基于所提出的编码方案的结构连接处理算法明显优于现有算法。接下来,我们研究结构连接的结果大小估计问题。估计结构连接结果的大小对于在XML查询优化器中生成有效的XML查询处理计划至关重要。我们提出了两个模型,区间模型和位置模型,根据它们可以将原始估计问题分别转换为估计空间连接和关系等连接的大小。提出了一种基于直方图和采样技术的估计方法,不仅具有较高的精度,而且在估计上具有理论上的保证。进行了综合性能研究。结果表明,我们新提出的估计方法的准确性和鲁棒性要比以前的方法高一个数量级。

著录项

  • 作者

    Wang, Wei.;

  • 作者单位

    Hong Kong University of Science and Technology (People's Republic of China).;

  • 授予单位 Hong Kong University of Science and Technology (People's Republic of China).;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2004
  • 页码 120 p.
  • 总页数 120
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号