首页> 外国专利> Indexing and querying XML structured documents with XPath queries

Indexing and querying XML structured documents with XPath queries

机译:使用XPath查询索引和查询XML结构化文档

摘要

With the growing importance of XML in data representation and exchange in large-scale scientific and commercial applications, substantial research efforts have been made to provide efficient indexing and querying mechanisms to retrieve data from XML-formatted data. However, traditional sequence-based indexing and querying mechanism suffers from the following three shortcomings. (i) Semantic flaw problem. That is, an XML document which semantically satisfies an XML query may not be returned; (ii) False answer (alarm) problem, i.e., documents which do not satisfy a query may be wrongly returned. Additional time-consuming refinement or post-processing phase has to be called to eliminate false query answers; (iii) Traditional sequence-based mechanism cannot guarantee the linear size complexity of structure-encoded sequences. In the worst case, its space complexity reaches O(n2) (where n is the total number of nodes in an XML data tree). To overcome its shortcomings, in this pattern application, we present another efficient and novel geometric sequence mechanism, which transforms XML documents and XPath queries into the corresponding geometric data/query sequences. XML querying is thus converted to finding non-contiguous geometric subsequence matches. Our approach ensures correct (i.e., without semantic false) and fast (i.e., without the costly post-processing phase) evaluation of XPath queries, while at the same time guaranteeing the linear space complexity.
机译:随着XML在大规模科学和商业应用程序中的数据表示和交换中的重要性日益提高,已经进行了大量研究工作,以提供有效的索引和查询机制来从XML格式的数据中检索数据。但是,传统的基于序列的索引和查询机制存在以下三个缺点。 (i)语义缺陷问题。也就是说,可能不会返回语义上满足XML查询的XML文档; (ii)错误答案(警报)问题,即不满足查询要求的文档可能会被错误地返回。必须调用其他耗时的优化或后处理阶段,以消除错误的查询答案。 (iii)传统的基于序列的机制无法保证结构编码序列的线性大小复杂性。在最坏的情况下,其空间复杂度达到O(n <2>)(其中n是XML数据树中节点的总数)。为了克服其缺点,在此模式应用程序中,我们提出了另一种高效新颖的几何序列机制,该机制将XML文档和XPath查询转换为相应的几何数据/查询序列。 XML查询因此被转换为查找非连续的几何子序列匹配。我们的方法可确保对XPath查询进行正确的评估(即没有语义错误)和快速的评估(即没有昂贵的后处理阶段),同时又保证了线性空间的复杂性。

著录项

  • 公开/公告号GB2417799A

    专利类型

  • 公开/公告日2006-03-08

    原文格式PDF

  • 申请/专利权人 YUN SHEN;LING FENG;

    申请/专利号GB20040019771

  • 发明设计人 YUN SHEN;LING FENG;

    申请日2004-09-07

  • 分类号G06F17/30;G06F17/22;

  • 国家 GB

  • 入库时间 2022-08-21 21:16:31

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号