...
首页> 外文期刊>Journal of Zhejiang University. Science, A >Efficient processing of ordered XML twig pattern matching based on extended Dewey
【24h】

Efficient processing of ordered XML twig pattern matching based on extended Dewey

机译:基于延伸杜威的有序XML Twig模式匹配的高效处理

获取原文

摘要

Finding all occurrences of a twig pattern is a core operation of extensible markup language (XML) query processing. Holistic twig join algorithms, which avoid a large number of intermediate results, represent the state-of-the-art algorithms. However, ordered XML twig join is mentioned rarely in the literature and previous algorithms developed in attempts to solve the problem of ordered twig pattern (OTP) matching have poor performance. In this paper, we first propose a novel children linked stacks encoding scheme to represent compactly the partial ordered twig join results. Based on this encoding scheme and extended Dewey, we design a novel holistic OTP matching algorithm, called OTJFast, which needs only to access the labels of the leaf query nodes. Furthermore, we propose a new algorithm, named OTJFaster, incorporating three effective optimization rules to avoid unnecessary computations. This works well on available indices (such as B+-tree), skipping useless elements. Thus, not only is disk access reduced greatly, but also many unnecessary computations are avoided. Finally, our extensive experiments over both real and synthetic datasets indicate that our algorithms are superior to previous approaches.
机译:查找枝条图案的所有出现是可扩展标记语言(XML)查询处理的核心操作。整体曲线加入算法,避免了大量中间结果,代表了最先进的算法。然而,很少在文献中提到有序的XML Twig连接,并在尝试解决有序的枝条图案(OTP)匹配问题时开发的先前算法具有差的性能。在本文中,我们首先提出了一种新颖的儿童链接堆栈编码方案,表示紧凑的部分有序的曲线连接结果。基于此编码方案和扩展杜威,我们设计了一种名为OTJFAST的新型全能OTP匹配算法,只需要访问叶查询节点的标签。此外,我们提出了一种名为Otjfaster的新算法,包含三种有效的优化规则,以避免不必要的计算。这适用于可用指数(例如B + -Tree),跳过无用的元素。因此,不仅磁盘访问很大,而且还避免了许多不必要的计算。最后,我们对真实和合成数据集的广泛实验表明我们的算法优于先前的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号