首页> 外文学位 >The role of structural aggregation for query processing over XML data.
【24h】

The role of structural aggregation for query processing over XML data.

机译:结构聚合在XML数据查询处理中的作用。

获取原文
获取原文并翻译 | 示例

摘要

With the advent of XML as the basis for many data-centric applications, issues regarding the effective retrieval of XML data have become prevalent. In this context, XML query evaluation presents unique challenges mainly because existing relational query algorithms cannot be directly applied to process XML data for diverse reasons: XML data conform to a tree-format rather than a tabular one, do not follow a strict schema, and are typically textual with repetitive information. A number of data structures—known as structural summaries—have been defined to compensate for the XML data repetition and lack of schema. So far, these summaries have been explored mainly as secondary indexes that can identify nodes reachable from specific path patterns. This dissertation shows that such summaries can also indicate new data clustering and partitioning policies that are very beneficial for XML processing. Even though this aspect has started to receive some attention, there is yet to exist a comprehensive study on using summaries as data clustering technique and on their partitioning properties with respect to XML query processing. Furthermore, various questions regarding the structural summaries behavior when processing both stored data and streams of data are still open.;Therefore, this dissertation examines query processing over XML data by exploring and extending the role of the structural aggregation properties provided by the summaries. Specifically, it evaluates and proposes algorithms for processing path queries over the partitions defined by the summaries. It introduces how the summaries can be employed as access methods and discusses the advantages and drawbacks of such context. It considers the typical query evaluation scenario of processing stored documents and returning the document nodes that satisfy a query (XPath semantics). Finally, it takes the role of structural aggregation one step further and introduces how the summaries can improve the performance of stream processing within the context of XML filtering. The overall objective is to show that structural aggregation methods can be employed efficiently in a variety of scenarios that are way more complex than the traditional secondary path indexing.
机译:随着XML的出现作为许多以数据为中心的应用程序的基础,有关有效检索XML数据的问题变得普遍起来。在这种情况下,XML查询评估提出了独特的挑战,这主要是因为由于各种原因,现有的关系查询算法无法直接应用于处理XML数据:XML数据遵循树格式而非表格格式,不遵循严格的模式,并且通常是带有重复信息的文本。已经定义了许多数据结构(称为结构摘要)来补偿XML数据重复和缺少模式。到目前为止,这些摘要主要被用作辅助索引,可以识别从特定路径模式可到达的节点。本文表明,这些总结也可以表明新的数据聚类和分区策略,这对XML处理非常有利。尽管这方面已开始引起人们的注意,但对于使用摘要作为数据聚类技术及其关于XML查询处理的分区属性,还没有进行全面的研究。此外,关于结构摘要在处理存储的数据和数据流时的行为的各种问题仍未解决。因此,本论文通过探索和扩展摘要提供的结构聚合属性的作用,研究了对XML数据的查询处理。具体来说,它评估并提出了用于在摘要定义的分区上处理路径查询的算法。它介绍了如何将摘要用作访问方法,并讨论了此类上下文的优缺点。它考虑了处理存储文档并返回满足查询条件的文档节点(XPath语义)的典型查询评估方案。最后,它将结构聚合的作用进一步向前推进,并介绍摘要如何在XML过滤的上下文中提高流处理的性能。总体目标是表明,结构聚合方法可以在比传统二级路径索引更为复杂的各种情况下有效使用。

著录项

  • 作者

    Moro, Mirella Moura.;

  • 作者单位

    University of California, Riverside.;

  • 授予单位 University of California, Riverside.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2007
  • 页码 160 p.
  • 总页数 160
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号