首页> 外文期刊>Journal of applied statistics >Cluster detection and clustering with random start forward searches
【24h】

Cluster detection and clustering with random start forward searches

机译:通过随机开始向前搜索进行聚类检测和聚类

获取原文
获取原文并翻译 | 示例
       

摘要

The forward search is a method of robust data analysis in which outlier free subsets of the data of increasing size are used in model fitting; the data are then ordered by closeness to the model. Here the forward search, with many random starts, is used to cluster multivariate data. These random starts lead to the diagnostic identification of tentative clusters. Application of the forward search to the proposed individual clusters leads to the establishment of cluster membership through the identification of non-cluster members as outlying. The method requires no prior information on the number of clusters and does not seek to classify all observations. These properties are illustrated by the analysis of 200 six-dimensional observations on Swiss banknotes. The importance of linked plots and brushing in elucidating data structures is illustrated. We also provide an automatic method for determining cluster centres and compare the behaviour of our method with model-based clustering. In a simulated example with eight clusters our method provides more stable and accurate solutions than model-based clustering. We consider the computational requirements of both procedures.
机译:前向搜索是一种鲁棒的数据分析方法,其中,在模型拟合中使用大小递增的离群值自由数据子集。然后根据与模型的接近程度对数据进行排序。在这里,具有许多随机起点的正向搜索用于对多元数据进行聚类。这些随机开始导致对诊断性簇的诊断鉴定。通过将非集群成员标识为外围,将向前搜索应用于建议的单个集群将导致建立集群成员。该方法不需要有关聚类数量的先验信息,也不寻求对所有观察进行分类。通过对200张瑞士纸币的六维观测结果进行分析,可以说明这些特性。说明了链接图和笔刷在阐明数据结构中的重要性。我们还提供了一种自动方法来确定聚类中心,并将我们的方法的行为与基于模型的聚类进行比较。在具有八个群集的模拟示例中,我们的方法比基于模型的群集提供了更稳定和准确的解决方案。我们考虑两个过程的计算要求。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号