Cluster detection and clustering with random start forward searches

Atkinson Anthony C.; Riani Marco; Cerioli Andrea

首页> 外文期刊>Journal of applied statistics >Cluster detection and clustering with random start forward searches

【24h】

Cluster detection and clustering with random start forward searches

机译：通过随机开始向前搜索进行聚类检测和聚类

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The forward search is a method of robust data analysis in which outlier free subsets of the data of increasing size are used in model fitting; the data are then ordered by closeness to the model. Here the forward search, with many random starts, is used to cluster multivariate data. These random starts lead to the diagnostic identification of tentative clusters. Application of the forward search to the proposed individual clusters leads to the establishment of cluster membership through the identification of non-cluster members as outlying. The method requires no prior information on the number of clusters and does not seek to classify all observations. These properties are illustrated by the analysis of 200 six-dimensional observations on Swiss banknotes. The importance of linked plots and brushing in elucidating data structures is illustrated. We also provide an automatic method for determining cluster centres and compare the behaviour of our method with model-based clustering. In a simulated example with eight clusters our method provides more stable and accurate solutions than model-based clustering. We consider the computational requirements of both procedures.

机译：前向搜索是一种鲁棒的数据分析方法，其中，在模型拟合中使用大小递增的离群值自由数据子集。然后根据与模型的接近程度对数据进行排序。在这里，具有许多随机起点的正向搜索用于对多元数据进行聚类。这些随机开始导致对诊断性簇的诊断鉴定。通过将非集群成员标识为外围，将向前搜索应用于建议的单个集群将导致建立集群成员。该方法不需要有关聚类数量的先验信息，也不寻求对所有观察进行分类。通过对200张瑞士纸币的六维观测结果进行分析，可以说明这些特性。说明了链接图和笔刷在阐明数据结构中的重要性。我们还提供了一种自动方法来确定聚类中心，并将我们的方法的行为与基于模型的聚类进行比较。在具有八个群集的模拟示例中，我们的方法比基于模型的群集提供了更稳定和准确的解决方案。我们考虑两个过程的计算要求。

著录项

来源
《Journal of applied statistics》 |2018年第8期|777-798|共22页
作者
Atkinson Anthony C.; Riani Marco; Cerioli Andrea;
展开▼
作者单位

London Sch Econ, Dept Stat, London WC2A 2AE, England;

Univ Parma, Dipartimento Sci Econ & Aziendale, Parma, Italy;

Univ Parma, Dipartimento Sci Econ & Aziendale, Parma, Italy;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Brushing; data structure; forward search; graphical methods; linked plots; Mahalanobis distance; MM estimation; outliers; S estimation; Tukey's biweight; 62H30; 62P20; 62F35;

机译：涂刷;数据结构;前向搜索;图形方法;链接图;马哈拉诺比斯距离;MM估计;离群值;S估计;Tukey的权重;62H30;62P20;62F35;

相似文献

外文文献
中文文献
专利

1. Pay-it-forward gonorrhea and chlamydia testing among men who have sex with men in China: a study protocol for a three-arm cluster randomized controlled trial [J] . Tiange P.Zhang, Guodong Mi, Yehua Wang, 贫困所致传染病（英文） . 2019,第004期

机译：在中国与男性发生性关系的男性中进行付费性淋病和衣原体检测：一项三臂类群随机对照试验的研究方案
2. Secure Connectivity Probability of Multi‐hop Clustered Randomize‐and‐Forward Networks [J] . Xiaowei Wang, Zhou Su, Guangyi Wang ETRI journal . 2017,第5期

机译：多跳群集随机转发网络的安全连接概率
3. Impact of a store-and-forward teledermatology intervention versus usual care on delay before beginning treatment: A pragmatic cluster-randomized trial in ambulatory care [J] . Piette Edouard, Nougairede Michel, Vuong Valerie, Journal of telemedicine and telecare . 2017,第8期

机译：在开始治疗前，店内杂草学的干预与通常护理延迟的影响：车辆护理中的务实簇随机试验
4. Minimum Cluster Size Estimation and Cluster Refinement for the Randomized Gravitational Clustering Algorithm [C] . Jonatan Gomez, Elizabeth Leon, Olfa Nasraoui Ibero-American conference on artificial intelligence . 2012

机译：随机引力聚类算法的最小聚类大小估计和聚类细化
5. Challenges in the Ethical Conduct and Ethics Review of Cluster Randomized Trials: A Survey of Cluster Randomization Trialists. [D] . Chaudhry, Shazia Hira. 2012

机译：聚类随机试验的道德行为和伦理审查中的挑战：聚类随机试验者的调查。
6. Pay-it-forward gonorrhea and chlamydia testing among men who have sex with men in China: a study protocol for a three-arm cluster randomized controlled trial [O] . Tiange P. Zhang, Fan Yang, Weiming Tang, 2019

机译：在中国与男性发生性关系的男性中进行付费性淋病和衣原体检测：一项三臂类群随机对照试验的研究方案
7. Cluster detection and clustering with random start forward searches [O] . Atkinson, Anthony C., Riani, Marco, Cerioli, Andrea 2017

机译：通过随机开始向前搜索进行聚类检测和聚类
8. Feasibility of Utilizing Cluster Sampling in the Navy: A Demographic Comparison of Cluster and Stratified Random Samples [R] . Newell, C. E. , Dever, J. A. 2004

机译：海军利用集群抽样的可行性：聚类与分层随机样本的人口学比较

Cluster detection and clustering with random start forward searches

摘要

著录项

相似文献

相关主题

期刊订阅