...
首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >An incremental nested partition method for data clustering
【24h】

An incremental nested partition method for data clustering

机译:用于数据集群的增量式嵌套分区方法

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Clustering methods are a powerful tool for discovering patterns in a given data set through an organization of data into subsets of objects that share common features. Motivated by the independent use of some different partitions criteria and the theoretical and empirical analysis of some of its properties, in this paper, we introduce an incremental nested partition method which combines these partitions criteria for finding the inner structure of static and dynamic datasets. For this, we proved that there are relationships of nesting between partitions obtained, respectively, from these partition criteria, and besides that the sensitivity when a new object arrives to the dataset is rigorously studied. Our algorithm exploits all of these mathematical properties for obtaining the hierarchy of clusterings. Moreover, we realize a theoretical and experimental comparative study of our method with classical hierarchical clustering methods such as single-link and complete-link and other more recently introduced methods. The experimental results over databases of UCI repository and the AFP and TDT2 news collections show the usefulness and capability of our method to reveal different levels of information hidden in datasets.
机译:聚类方法是一种强大的工具,可通过将数据组织成共享共同特征的对象子集来发现给定数据集中的模式。出于独立使用一些不同分区准则以及对其某些属性进行理论和经验分析的动机,本文介绍了一种增量嵌套分区方法,该方法结合了这些分区准则以查找静态和动态数据集的内部结构。为此,我们证明了从这些分区标准分别获得的分区之间存在嵌套关系,此外,还严格研究了新对象到达数据集时的敏感性。我们的算法利用所有这些数学属性来获得聚类的层次结构。此外,我们通过经典的层次聚类方法(例如单链接和完全链接)以及其他最近引入的方法,实现了我们方法的理论和实验比较研究。在UCI资料库的数据库以及AFP和TDT2新闻集上进行的实验结果表明,我们的方法能够揭示隐藏在数据集中的不同级别信息的有用性和能力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号