首页> 外文学位 >Efficient structural join processing algorithms.
【24h】

Efficient structural join processing algorithms.

机译:高效的结构连接处理算法。

获取原文
获取原文并翻译 | 示例

摘要

Extensible Markup Language (XML) has become the de facto standard for data representation, exchange and publishing over the Internet. As more and more XML data is gathered and processed by database management systems, we need innovative approaches to efficiently manage and query it. However, XML queries possess some unique features, which distinguish them from the traditional RDBMS queries and pose a challenge to the XML research community. This dissertation investigates the problem of how to improve the XML path expression performance by considering the unique features of path expression, since path expression is the core part of XML queries. By fully exploiting these features, we are able to propose some novel algorithms to significantly improve XML path expression performance.; We first investigate the issue of optimizing XML path expressions without considering the workload. A basic kind of XML path expression is the structural query, which retrieves all descendants of a specific XML element. Structural queries are important because they constitute the building block of various complex XML path expressions. Among proposed XML query processing techniques for structural queries, structural join outperforms most other graph-traversal based approaches as it minimizes the number of nodes accessed to evaluate a structural query. In the first part of this dissertation we show that general path expressions, involving conditions on attributes, possess inherent multi-dimensional characteristics that are not captured by existing XML query processing methods based on 1D ordering. Motivated by this, we index XML data with multi-dimensional access methods and develop efficient algorithms to minimize the query cost. Extensive experimental evaluations confirm that our algorithms provide significant performance gains for numerous query types.; Our next goal is to further improve the path expression performance by considering the query workload. All existing indices for structural join focus on indexing the encoding information of XML elements so as to improve the XML path expression performance. Consequently, such indices utilize only data characteristics, but ignore query characteristics that are important for the further improvement of the query performance. In this thesis, we propose AC-tree (Adaptive Cluster B+-tree), which is a fully workload-aware structural join index, to provide a simple, but efficient way to exploit the XML query characteristics for performance improvement. To the best of our knowledge, AC-tree is the first structural join index to take the workload into consideration. An extensive set of experiments confirm that AC-tree outperforms competitors significantly for both simple (i.e., non-branching) XML path expressions and branching path expressions.
机译:可扩展标记语言(XML)已成为Internet上数据表示,交换和发布的事实上的标准。随着数据库管理系统收集和处理越来越多的XML数据,我们需要创新的方法来有效地管理和查询它。但是,XML查询具有一些独特的功能,这使它们与传统的RDBMS查询区分开来,并给XML研究社区带来了挑战。由于路径表达式是XML查询的核心部分,因此本文通过考虑路径表达式的独特性来研究如何提高XML路径表达式的性能。通过充分利用这些功能,我们可以提出一些新颖的算法来显着提高XML路径表达的性能。我们首先研究不考虑工作负载而优化XML路径表达式的问题。 XML路径表达式的一种基本类型是结构查询,它检索特定XML元素的所有后代。结构化查询很重要,因为它们构成了各种复杂的XML路径表达式的基础。在针对结构化查询提出的XML查询处理技术中,结构化连接的性能优于大多数其他基于图遍历的方法,因为它可以最小化为评估结构化查询而访问的节点数量。在本文的第一部分中,我们展示了涉及属性条件的通用路径表达式具有固有的多维特性,这些特性是现有的基于一维排序的XML查询处理方法无法捕获的。因此,我们使用多维访问方法为XML数据建立索引,并开发有效的算法以最小化查询成本。大量的实验评估证实,我们的算法可为多种查询类型提供显着的性能提升。我们的下一个目标是通过考虑查询工作负载来进一步提高路径表达性能。现有的所有用于结构连接的索引都集中在索引XML元素的编码信息上,以提高XML路径表达的性能。因此,这样的索引仅利用数据特征,而忽略对于进一步改善查询性能很重要的查询特征。在本文中,我们提出了AC树(自适应集群B +树),它是一种完全了解工作负载的结构连接索引,它提供了一种简单而有效的方法来利用XML查询特性来提高性能。据我们所知,AC-tree是第一个考虑工作量的结构化连接索引。大量的实验证实,对于简单(即非分支)的XML路径表达式和分支路径表达式,AC树都大大优于竞争对手。

著录项

  • 作者

    Liu, Kaiyang.;

  • 作者单位

    Hong Kong University of Science and Technology (People's Republic of China).;

  • 授予单位 Hong Kong University of Science and Technology (People's Republic of China).;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2005
  • 页码 84 p.
  • 总页数 84
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号