首页> 外文会议>ACM SIGMOD international conference on Management of data >Tree logical classes for efficient evaluation of XQuery
【24h】

Tree logical classes for efficient evaluation of XQuery

机译:树逻辑类,用于有效评估XQuery

获取原文

摘要

XML is widely praised for its flexibility in allowing repeated and missing sub-elements. However, this flexibility makes it challenging to develop a bulk algebra, which typically manipulates sets of objects with identical structure. A set of XML elements, say of type book, may have members that vary greatly in structure, e.g. in the number of author sub-elements. This kind of heterogeneity may permeate the entire document in a recursive fashion: e.g., different authors of the same or different book may in turn greatly vary in structure. Even when the document conforms to a schema, the flexible nature of schemas for XML still allows such significant variations in structure among elements in a collection. Bulk processing of such heterogeneous sets is problematic.In this paper, we introduce the notion of logical classes (LC) of pattern tree nodes, and generalize the notion of pattern tree matching to handle node logical classes. This abstraction pays off significantly in allowing us to reason with aninherently heterogeneous collection of elements in a uniform, homogeneous way. Based on this, we define a Tree Logical Class (TLC) algebra that is capable of handling the heterogeneity arising in XML query processing, while avoiding redundant work. We present an algorithm to obtain a TLC algebra expression from an XQuery statement (for a large fragment of XQuery). We show how to implement the TLC algebra efficiently, introducing the nest-join as an important physical operator for XML query processing. We show that evaluation plans generated using the TLC algebra not only are simpler but also perform better than those generated by competing approaches. TLC is the algebra used in the Timber [8] system developed at the University of Michigan.
机译:XML因其允许重复和丢失子元素的灵活性而广受赞誉。但是,这种灵活性使开发体代数具有挑战性,该体通常可操纵具有相同结构的对象集。一组XML元素(例如类型书籍)的成员结构可能会有很大不同,例如作者子元素的数量。这种异质性可能以递归的方式渗透到整个文档中:例如,同一本书或不同书籍的不同作者反过来可能会在结构上大相径庭。即使文档符合架构,XML架构的灵活性仍然允许集合中元素之间的结构发生如此重大的变化。这种异构集的批量处理是有问题的。在本文中,我们介绍了模式树节点的逻辑类(LC)的概念,并概括了模式树匹配的概念以处理节点逻辑类。这种抽象在使我们能够以统一,均质的方式对元素固有的异构集合进行推理方面取得了显着的回报。基于此,我们定义了一个 Tree Logical Class(TLC)代数,该代数能够处理XML查询处理中出现的异构性,同时避免了多余的工作。我们提出了一种从XQuery语句(对于XQuery的较大片段)获取TLC代数表达式的算法。我们介绍了如何有效地实现TLC代数,并介绍了嵌套连接作为XML查询处理的重要物理运算符。我们证明,使用TLC代数生成的评估计划不仅比竞争方法生成的评估计划更简单,而且执行效果更好。 TLC是密歇根大学开发的T imber [8]系统中使用的代数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号