首页> 外文会议>International Conference on Data Engineering >The Space Complexity of Processing XML Twig Queries Over Indexed Documents
【24h】

The Space Complexity of Processing XML Twig Queries Over Indexed Documents

机译:处理索引文档的XML Twig查询的空间复杂性

获取原文

摘要

Current twig join algorithms incur high memory costs on queries that involve child-axis nodes. In this paper we provide an analytical explanation for this phenomenon. In a first large-scale study of the space complexity of evaluating XPath queries over indexed XML documents we show the space to depend on three factors: (1) whether the query is a path or a tree; (2) the types of axes occurring in the query and their occurrence pattern; and (3) the mode of query evaluation (filtering, full-fledged, or "pattern matching"). Our lower bounds imply that evaluation of a large class of queries that have child-axis nodes indeed requires large space. Our study also reveals that on some queries there is a large gap between the space needed for pattern matching and the space needed for full-fledged evaluation or filtering. This implies that many existing twig join algorithms, which work in the pattern matching mode, incur significant space overhead. We present a new twig join algorithm that avoids this overhead. On certain queries our algorithm is exceedingly more space-efficient than existing algorithms, sometimes bringing the space down from linear in the document size to constant.
机译:当前的Twig加入算法在涉及子轴节点的查询时产生高的内存成本。在本文中,我们为这种现象提供了一个分析解释。在索引XML文档中评估XPath查询的第一个大规模研究,我们显示了依赖三个因素的空间:(1)查询是路径还是树; (2)查询中发生的轴的类型及其发生模式; (3)查询评估模式(过滤,全面崩溃或“模式匹配”)。我们的下限意味着评估具有子轴节点的大类查询,确实需要大空间。我们的研究还揭示了一些查询,模式匹配所需的空间与全面评估或过滤所需的空间之间存在巨大差距。这意味着许多现有的曲线连接算法,在模式匹配模式下工作,产生了高度的空间开销。我们提出了一种新的曲线连接算法,避免了这个开销。在某些查询中,我们的算法比现有算法更高的空间效率,有时将空间从文档大小中的线性从线性带到常量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号