【24h】

An Efficient Algorithm for Clustering XML Schemas

机译:高效的XML模式聚类算法

获取原文
获取原文并翻译 | 示例

摘要

Schema clustering is important as a prerequisite to the integration of XML schemas. This paper presents an efficient method for clustering XML schemas. The proposed method first computes similarities among schemas. The similarity is defined by the size of the common structure between two schemas under the assumption that the schemas with less cost to be integrated are more similar. Specifically, we extract one-to-one matchings between paths with the largest number of corresponding elements. Finally, a hierarchical clustering method is applied to the value of similarity. Experimental results with many XML schemas show that the method has performed better compared with previous works, resulting in a precision of 98% and a rate of clustering of 95% in average.
机译:架构集群对于XML架构的集成非常重要。本文提出了一种有效的XML模式集群方法。所提出的方法首先计算架构之间的相似度。相似度是由两个模式之间的通用结构的大小定义的,前提是要整合成本较低的模式更为相似。具体来说,我们提取具有最大数量对应元素的路径之间的一对一匹配。最后,将层次聚类方法应用于相似度值。在许多XML模式下的实验结果表明,与以前的工作相比,该方法具有更好的性能,其精度为98%,平均聚类率为95%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号