首页> 外文会议>2012 6th IEEE International Conference on Intelligent Systems.;vol. 1-2. >Structure-oriented clustering of XML documents: A transactional approach
【24h】

Structure-oriented clustering of XML documents: A transactional approach

机译:XML文档的面向结构的集群:事务处理方法

获取原文
获取原文并翻译 | 示例

摘要

Clustering XML documents by structure has been, generally, accomplished by looking at the occurrence of one pre-established type of structural component in the structures of the XML documents. It is likely that focusing only on one type of structural component may produce clusters with a certain extent of inner structural inhomogeneity, because of uncaught differences in the structures of the XML documents or for an inappropriate choice of structural component. To overcome these limitations, a new parameter-free approach to clustering XML document is proposed, that allows to consider simultaneously multiple types of structural components to isolate structurally-homogeneous clusters of XML documents. The idea behind the approach is to represent each XML document as a transaction of boolean feature, enlightening of suitable selection of its structural components. A parameter-free clustering scheme is, then, used to isolate structural homogeneous clusters. A comparative evaluation over both real and synthetic XML data provides evidence of effectiveness and efficacy of the devised approach.
机译:通常,通过查看XML文档结构中一种预先建立的结构组件类型的出现,可以完成按结构对XML文档进行聚类。由于XML文档的结构中未捕获的差异或对结构组件的不适当选择,仅关注一种类型的结构组件可能会产生具有一定程度的内部结构不均匀性的簇。为了克服这些限制,提出了一种新的无参数的XML文档群集方法,该方法允许同时考虑多种类型的结构组件以隔离XML文档的结构同质群集。该方法背后的思想是将每个XML文档表示为布尔功能的事务,从而启发了对其结构组件的适当选择。然后,使用无参数的聚类方案来隔离结构同质的聚类。对真实和合成XML数据的比较评估提供了所设计方法有效性和有效性的证据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号