首页> 外文会议>ACM SIGMOD international conference on Management of data >Tree logical classes for efficient evaluation of XQuery
【24h】

Tree logical classes for efficient evaluation of XQuery

机译:树逻辑类,用于高效评估XQuery

获取原文

摘要

XML is widely praised for its flexibility in allowing repeated and missing sub-elements. However, this flexibility makes it challenging to develop a bulk algebra, which typically manipulates sets of objects with identical structure. A set of XML elements, say of type book, may have members that vary greatly in structure, e.g. in the number of author sub-elements. This kind of heterogeneity may permeate the entire document in a recursive fashion: e.g., different authors of the same or different book may in turn greatly vary in structure. Even when the document conforms to a schema, the flexible nature of schemas for XML still allows such significant variations in structure among elements in a collection. Bulk processing of such heterogeneous sets is problematic.In this paper, we introduce the notion of logical classes (LC) of pattern tree nodes, and generalize the notion of pattern tree matching to handle node logical classes. This abstraction pays off significantly in allowing us to reason with aninherently heterogeneous collection of elements in a uniform, homogeneous way. Based on this, we define a Tree Logical Class (TLC) algebra that is capable of handling the heterogeneity arising in XML query processing, while avoiding redundant work. We present an algorithm to obtain a TLC algebra expression from an XQuery statement (for a large fragment of XQuery). We show how to implement the TLC algebra efficiently, introducing the nest-join as an important physical operator for XML query processing. We show that evaluation plans generated using the TLC algebra not only are simpler but also perform better than those generated by competing approaches. TLC is the algebra used in the Timber [8] system developed at the University of Michigan.
机译:XML广泛称赞,以允许重复和缺少的子元素的灵活性。然而,这种灵活性使得开发散装代数的灵活性使得通常操纵具有相同结构的物体组。一组XML元素,撰写类型的书籍,可能有成员在结构中变得差异,例如,在作者子元素的数量中。这种异质性可以以递归方式渗透整个文件:例如,相同或不同的书的不同作者可能又可以大大变化。即使该文档符合架构,XML模式的灵活性仍然允许集合中元素之间的结构中的结构显着变化。这种异构组的批量处理是有问题的。在本文中,我们介绍了模式树节点的逻辑类(LC)的概念,并概括了模式树匹配的概念来处理节点逻辑类。这种抽象显着偿还,使我们能够以均匀,同质化的方式与内均匀的元素收集有关。基于此,我们定义了一个树逻辑类(TLC)代数,其能够处理XML查询处理中产生的异构性,同时避免冗余工作。我们提出了一种算法,用于从XQuery语句中获取TLC代数表达式(对于XQuery的大片段)。我们展示了如何有效地实现TLC代数,将Nest-Join介绍为XML查询处理的重要物理运算符。我们表明使用TLC代数生成的评估计划不仅更简单,而且比竞争方法产生的更好。 TLC是在密歇根大学开发的T IMBER [8]系统中使用的代数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号