...
首页> 外文期刊>Journal of Information Science >A stream-based method to detect differences between XML documents
【24h】

A stream-based method to detect differences between XML documents

机译:基于流的方法来检测XML文档之间的差异

获取原文
获取原文并翻译 | 示例

摘要

Detecting differences between XML documents is one of most important research topics for XML. Since XML documents are generally considered to be organized in a tree structure, most previous research has attempted to detect differences using tree-matching algorithms. However, most tree-matching algorithms have inadequate performance owing to limitations in terms of the execution time, optimality and scalability. This study proposes a stream-based difference detection method in which an XML binary encoding algorithm is used to provide improved performance relative to that of previous tree-matching algorithms. A tree-structured analysis of XML is not essential in order to detect differences. We use a O-Path algorithm that has an optimal result quality for difference detection between two streams and has a lower time complexity than tree-based methods. We then modify the existing XML binary encoding method to tokenize the stream and the algorithm in order to support more operations than D-Path algorithm does. The experimental results reveal greater efficiency for the proposed method relative to tree-based methods. The execution time is at least 4 times faster than state-of-the-art tree-based methods. In addition, the scalability is much more efficient.
机译:检测XML文档之间的差异是XML最重要的研究主题之一。由于XML文档通常被认为是以树状结构组织的,因此大多数先前的研究都试图使用树匹配算法来检测差异。但是,由于执行时间,最优性和可伸缩性方面的限制,大多数树匹配算法的性能都不足。这项研究提出了一种基于流的差异检测方法,其中使用XML二进制编码算法来提供相对于以前的树匹配算法而言更高的性能。为了检测差异,对XML进行树结构分析不是必需的。我们使用一种O-Path算法,该算法在两个流之间进行差异检测时具有最佳的结果质量,并且比基于树的方法具有更低的时间复杂度。然后,我们修改现有的XML二进制编码方法以标记流和该算法,以支持比D-Path算法更多的操作。实验结果表明,相对于基于树的方法,该方法具有更高的效率。执行时间至少比最新的基于树的方法快4倍。此外,可伸缩性更加有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号