首页> 外文会议>Database and Expert Systems Applications >Prefix Path Streaming: A New Clustering Method for Optimal Holistic XML Twig Pattern Matching
【24h】

Prefix Path Streaming: A New Clustering Method for Optimal Holistic XML Twig Pattern Matching

机译:前缀路径流:用于最佳整体XML Twig模式匹配的新聚类方法

获取原文

摘要

Searching for all occurrences of a twig pattern in a XML document is an important operation in XML query processing. Recently a class of holistic twig pattern matching algorithms has been proposed. Compared with the prior approaches, the holistic method avoids generating large intermediate results which do not contribute to the final answer. The method is CPU and I/O optimal when twig patterns only have ancestor-descendant relationships.The holistic twig-pattern matching method proposed earlier operates on element streams which cluster all XML elements with the same tag name together. In this paper we introduce a clustering method called Prefix Path Streaming (PPS) and new holistic twig pattern matching algorithms based on PPS. PPS clusters elements of XML documents according to the paths from root to the elements. This clustering approach avoids unnecessary scanning of irrelevant portion of XML documents.More importantly, we develop optimal algorithms based on PPS streaming which can process a large class of twig patterns consisting of both ancestor-descendant and parent-child relationships.
机译:搜索XML文档中所有出现的树枝图案是XML查询处理中的重要操作。最近,提出了一种整体的枝条模式匹配算法。与现有方法相比,整体方法避免了产生较大的中间结果,而中间结果对最终答案没有帮助。当树枝模式仅具有祖先后裔关系时,该方法是CPU和I / O最佳的方法。较早提出的整体树枝模式匹配方法对元素流进行操作,该元素流将具有相同标签名称的所有XML元素聚集在一起。在本文中,我们介绍了一种称为前缀路径流(PPS)的聚类方法以及基于PPS的全新整体枝模式匹配算法。 PPS根据从根到元素的路径对XML文档的元素进行聚类。这种聚类方法避免了不必要地扫描XML文档的无关部分。更重要的是,我们开发了基于PPS流的最佳算法,该算法可以处理由祖先后代和父子关系组成的一大类树枝模式。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号