首页> 外文会议>International Conference on Web Information Systems Engineering >A Novel Method for Finding Similarities between Unordered Trees Using Matrix Data Model
【24h】

A Novel Method for Finding Similarities between Unordered Trees Using Matrix Data Model

机译:使用矩阵数据模型找到无序树之间的相似性的新方法

获取原文

摘要

Trees are capable of portraying the semi-structured data which is common in web domain. Finding similarities between trees is mandatory for several applications that deal with semi-structured data. Existing similarity methods examine a pair of trees by comparing through nodes and paths of two trees, and find the similarity between them. However, these methods provide unfavorable results for unordered tree data and result in yielding NP-hard or MAX-SNP hard complexity. In this paper, we present a novel method that encodes a tree with an optimal traversing approach first, and then, utilizes it to model the tree with its equivalent matrix representation for finding similarity between unordered trees efficiently. Empirical analysis shows that the proposed method is able to achieve high accuracy even on the large data sets.
机译:树木能够描绘在Web域中常见的半结构化数据。对于处理半结构化数据的若干应用程序,树木之间的相似性是强制性的。通过通过两棵树的节点和路径进行比较,找到现有的相似性方法,并找到它们之间的相似性。但是,这些方法为无序树数据提供了不利的结果,并导致产生NP-HARD或MAX-SNP的硬复杂性。在本文中,我们提出了一种新的方法,它首先利用最佳遍历方法对树进行编码,然后利用它用其等效矩阵表示来模拟树,以便有效地找到无序树之间的相似性。实证分析表明,即使在大数据集上也能够实现高精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号