首页> 外文会议>Advances in Knowledge Discovery and Data Mining >A New Sequential Mining Approach to XML Document Similarity Computation
【24h】

A New Sequential Mining Approach to XML Document Similarity Computation

机译:XML文档相似度计算的一种新的顺序挖掘方法

获取原文

摘要

There exist several methods to measuring the structural similarity among XML documents. The data mining approach seems to be a novel, interesting and promising one. In view of the deficiencies encountered by ignoring the hierarchical information in encoding the paths for mining, we propose a new sequential pattern mining scheme for XML document similarity computation. It makes use of the hierarchical information to computing the document structural similarity. In addition, it includes a post-processing step to reuse the mined patterns to estimate the similarity of unmatched elements so that another metric to qualify the similarity between XML documents can be introduced. Encouraging experimental results were obtained and reported.
机译:有几种方法可以测量XML文档之间的结构相似性。数据挖掘方法似乎是一种新颖,有趣且有前途的方法。鉴于在编码挖掘路径时忽略层次信息会遇到缺陷,我们提出了一种用于XML文档相似度计算的新的顺序模式挖掘方案。它利用层次信息来计算文档的结构相似度。另外,它还包括一个后处理步骤,以重用挖掘的模式来估计不匹配元素的相似度,从而可以引入另一个度量标准来限定XML文档之间的相似度。获得了令人鼓舞的实验结果并进行了报道。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号