首页> 外文会议>Advances in databases and information systems >Clustering XML Documents by Structure
【24h】

Clustering XML Documents by Structure

机译:按结构对XML文档进行聚类

获取原文
获取原文并翻译 | 示例

摘要

Clustering of XML documents is an important data mining method, the aim of which is the grouping of similar XML documents. The issue of clustering XML documents by structure is being considered in this paper. Two different and independent methods of clustering XML documents by structure are being proposed. The first method represents a set of XML documents as a set of labels. The second method introduces a new representation of a set of XML documents, which is called the SuperTree. In this paper, it is suggested that the proposed methods may improve the accuracy of XML clustering by structure. Such thesis is based on the tests, the aim of which is to assess advantages of the proposals, as conducted respectively on the heterogeneous and homogenous sets of data.
机译:XML文档的聚类是一种重要的数据挖掘方法,其目的是对相似的XML文档进行分组。本文考虑了按结构对XML文档进行集群的问题。提出了两种不同且独立的按结构对XML文档进行聚类的方法。第一种方法将一组XML文档表示为一组标签。第二种方法引入了一组XML文档的新表示形式,称为SuperTree。在本文中,建议所提出的方法可以通过结构提高XML聚类的准确性。这样的论文是基于测试的,其目的是评估提案的优势,分别针对异类和同类数据集进行。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号