A New Sequential Mining Approach to XML Document Similarity Computation

机译：XML文档相似度计算的一种新的顺序挖掘方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

There exist several methods to measuring the structural similarity among XML documents. The data mining approach seems to be a novel, interesting and promising one. In view of the deficiencies encountered by ignoring the hierarchical information in encoding the paths for mining, we propose a new sequential pattern mining scheme for XML document similarity computation. It makes use of the hierarchical information to computing the document structural similarity. In addition, it includes a post-processing step to reuse the mined patterns to estimate the similarity of unmatched elements so that another metric to qualify the similarity between XML documents can be introduced. Encouraging experimental results were obtained and reported.

机译：有几种方法可以测量XML文档之间的结构相似性。数据挖掘方法似乎是一种新颖，有趣且有前途的方法。鉴于在编码挖掘路径时忽略层次信息会遇到缺陷，我们提出了一种用于XML文档相似度计算的新的顺序模式挖掘方案。它利用层次信息来计算文档的结构相似度。另外，它还包括一个后处理步骤，以重用挖掘的模式来估计不匹配元素的相似度，从而可以引入另一个度量标准来限定XML文档之间的相似度。获得了令人鼓舞的实验结果并进行了报道。

著录项

来源
《Advances in Knowledge Discovery and Data Mining》|2003年|p.356-362|共7页
会议地点
作者
Ho-pong Leung; Fu-lai Chung; Stephen Chi-fai Chan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. On the use of hierarchical information in sequential mining-based XML document similarity computation [J] . Leung HP, Chung FL, Chan SCF Knowledge and information systems . 2005,第4期

机译：关于层次信息在基于顺序挖掘的XML文档相似度计算中的使用
2. On the use of hierarchical information in sequential mining-based XML document similarity computation [J] . Ho-pong Leung, Fu-lai Chung, Stephen Chi-fai Chan Knowledge and Information Systems . 2005,第4期

机译：关于层次信息在基于顺序挖掘的XML文档相似度计算中的使用
3. An efficient similarity-based approach for comparing XML documents [J] . Oliveira Alessandreia, Tessarolli Gabriel, Ghiotto Gleiph, Information Systems . 2018,第NOVa期

机译：一种有效的基于相似度的XML文档比较方法
4. A New Sequential Mining Approach to XML Document Similarity Computation [C] . Ho-pong Leung, Fu-lai Chung, Stephen Chi-fai Chan Pacific-Asia Conference on Knowledge Discovery and Data Mining . 2003

机译：一种新的XML文档相似性计算方法的顺序挖掘方法
5. A bottom-up approach for XML document classification. [D] . Wu, Junwei. 2009

机译：XML文档分类的自底向上方法。
6. Management of Clinical XML Documents: A Pragmatic Approach [O] . R Schweiger, T Buerkle, S Hoelzer, 2000

机译：临床XML文档管理：一种务实的方法
7. A new sequential mining approach to XML document similarity computation [O] . Leung HP, Chung FL, Chan SCF 2003

机译：XML文档相似度计算的一种新的顺序挖掘方法

A New Sequential Mining Approach to XML Document Similarity Computation

摘要

著录项

相似文献

相关主题

期刊订阅