首页> 外文会议>Twenty-ninth International Conference on Very Large Databases; Sep 9-12, 2003; Berlin, Germany >Efficient Processing of Expressive Node-Selecting Queries on XML Data in Secondary Storage: A Tree Automata-based Approach
【24h】

Efficient Processing of Expressive Node-Selecting Queries on XML Data in Secondary Storage: A Tree Automata-based Approach

机译:二级存储中对XML数据的表达性节点选择查询的高效处理:一种基于树自动机的方法

获取原文
获取原文并翻译 | 示例

摘要

We propose a new, highly scalable and efficient technique for evaluating node-selecting queries on XML trees which is based on recent advances in the theory of tree automata. Our query processing techniques require only two linear passes over the XML data on disk, and their main memory requirements are in principle independent of the size of the data. The overall running time is O(m + n), where m only depends on the query and n is the size of the data. The query language supported is very expressive and captures exactly all node-selecting queries answerable with only a bounded amount of memory (thus, all queries that can be answered by any form of finite-state system on XML trees). Visiting each tree node only twice is optimal, and current automata-based approaches to answering path queries on XML streams, which work using one linear scan of the stream, are considerably less expressive. These technical results - which give rise to expressive query engines that deal more efficiently with large amounts of data in secondary storage - are complemented with an experimental evaluation of our work.
机译:我们基于树自动机理论的最新进展,提出了一种用于评估XML树上的节点选择查询的,高度可伸缩且高效的新技术。我们的查询处理技术只需要对磁盘上的XML数据进行两次线性传递,并且它们的主内存要求在原则上与数据大小无关。总体运行时间为O(m + n),其中m仅取决于查询,而n是数据大小。支持的查询语言表达能力很强,并且仅捕获有限数量的内存即可准确地回答所有可以选择的节点选择查询(因此,可以由XML树上任何形式的有限状态系统回答的所有查询)。最好只访问每个树节点两次,并且基于XML的当前基于自动机的方法来回答路径查询(使用流的一次线性扫描来工作)的表达力要低得多。这些技术成果-产生了具有表现力的查询引擎,可以更有效地处理二级存储中的大量数据-补充了对我们工作的实验评估。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号