首页> 外文会议>International Conference on Web Information Systems Engineering >A Novel Method for Finding Similarities between Unordered Trees Using Matrix Data Model
【24h】

A Novel Method for Finding Similarities between Unordered Trees Using Matrix Data Model

机译:使用矩阵数据模型找到无序树之间的相似性的新方法

获取原文

摘要

Trees are capable of portraying the semi-structured data which is common in web domain. Finding similarities between trees is mandatory for several applications that deal with semi-structured data. Existing similarity methods examine a pair of trees by comparing through nodes and paths of two trees, and find the similarity between them. However, these methods provide unfavorable results for unordered tree data and result in yielding NP-hard or MAX-SNP hard complexity. In this paper, we present a novel method that encodes a tree with an optimal traversing approach first, and then, utilizes it to model the tree with its equivalent matrix representation for finding similarity between unordered trees efficiently. Empirical analysis shows that the proposed method is able to achieve high accuracy even on the large data sets.
机译:树木能够描绘在Web域中常见的半结构化数据。 对于处理半结构化数据的若干应用程序,在树木之间找到相似性。 通过通过两个树的节点和路径进行比较,找到一对树,并找到它们之间的相似性。 但是,这些方法为无序树数据提供了不利的结果,并导致产生NP硬或MAX-SNP的硬复杂性。 在本文中,我们介绍了一种新的方法,它首先使用最佳遍历方法对树进行编码,然后利用它用其等效矩阵表示来模拟树,以便有效地在无序树之间找到相似性。 实证分析表明,即使在大数据集上也能够实现高精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号