首页> 外文会议>Hellenic Conference on AI >Clustering XML Documents by Structure
【24h】

Clustering XML Documents by Structure

机译:按结构群集XML文档

获取原文

摘要

This work explores the application of clustering methods for grouping structurally similar XML documents. Modeling the XML documents as rooted ordered labeled trees, we apply clustering algorithms using distances that estimate the similarity between those trees in terms of the hierarchical relationships of their nodes. We suggest the usage of tree structural summaries to improve the performance of the distance calculation and at the same time to maintain or even improve its quality. Experimental results are provided using a prototype testbed.
机译:此工作探讨了群集方法的应用,以便在结构上进行结构上类似的XML文档。将XML文档建模为rooted已标记的树木,我们使用差距应用群集算法,这些距离在其节点的分层关系方面估计这些树之间的相似性。我们建议使用树结构摘要来提高距离计算的性能,同时保持甚至提高其质量。使用原型测试的实验结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号