Breakdown points for maximum likelihood estimators of location-scale mixtures

Hennig C

首页> 外文期刊>The Annals of Statistics: An Official Journal of the Institute of Mathematical Statistics >Breakdown points for maximum likelihood estimators of location-scale mixtures

【24h】

Breakdown points for maximum likelihood estimators of location-scale mixtures

机译：位置尺度混合的最大似然估计的分解点

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

ML-estimation based on mixtures of Normal distributions is a widely used tool for cluster analysis. However, a single outlier can make the parameter estimation of at least one of the mixture components break down. Among others, the estimation of mixtures of t-distributions by McLachlan and Peel [Finite Mixture Models (2000) Wiley, New York] and the addition of a further mixture component accounting for "noise" by Fraley and Raftery [The Computer J. 41 (1998) 578-588] were suggested as more robust alternatives. In this paper, the definition of an adequate robustness measure for cluster analysis is discussed and bounds for the breakdown points of the mentioned methods are given. It turns out that the two alternatives, while adding stability in the presence of outliers of moderate size, do not possess a substantially better breakdown behavior than estimation based on Normal mixtures. If the number of clusters s is treated as fixed, r additional points suffice for all three methods to let the parameters of r clusters explode. Only in the case of r = s is this not possible for t-mixtures. The ability to estimate the number of mixture components, for example, by use of the Bayesian information criterion of Schwarz [Ann. Statist. 6 (1978) 461-464], and to isolate gross outliers as clusters of one point, is crucial for all improved breakdown behavior of all three techniques. Furthermore, a mixture of Normals with an improper uniform distribution is proposed to achieve more robustness in the case of a fixed number of components.

机译：基于正态分布混合的ML估计是一种广泛用于聚类分析的工具。然而，单个离群值可以使至少一种混合成分的参数估计崩溃。其中，麦克拉克兰（McLachlan）和皮尔（Peel）估计t分布的混合[Finite Mixture Models（2000），纽约威利]，以及弗雷利和拉夫蒂（Fraley and Raftery）添加了另外一个考虑“噪声”的混合成分[计算机杂志41] （1998）578-588]被提出作为更可靠的替代方案。在本文中，讨论了用于聚类分析的适当鲁棒性度量的定义，并给出了上述方法的崩溃点的界限。事实证明，这两种选择在存在中等大小的异常值时增加了稳定性，但没有比基于普通混合物进行估计的击穿性能好得多。如果将簇数s视为固定，则对于这三种方法，r个附加点就足以使r簇的参数爆炸。仅在r = s的情况下，这对于t混合物是不可能的。估计混合成分数量的能力，例如，通过使用Schwarz的贝叶斯信息准则[Ann。统计员。 6（1978）461-464]，以及将总体异常值隔离为一个点的群集，对于所有这三种技术的所有改进的故障行为至关重要。此外，建议使用具有不适当均匀分布的法线的混合，以在组件数量固定的情况下实现更高的鲁棒性。

著录项

来源
《The Annals of Statistics: An Official Journal of the Institute of Mathematical Statistics》 |2004年第4期|共28页
作者
Hennig C;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类概率论与数理统计;
关键词
model-based cluster analysis; robust statistics; Normal mixtures; mixtures of t-distributions; noise component; classification breakdown point; EM-ALGORITHM; ROBUST ESTIMATION; MODEL; IDENTIFICATION; DISTRIBUTIONS; METHODOLOGY; CLUSTERS;

机译：基于模型的聚类分析;稳健统计;正态混合物;t分布的混合物;噪声成分;分类分解点;EM-算法;鲁棒估计;模型;标识;分布;方法论;聚类;
入库时间 2022-08-18 18:25:02

相似文献

外文文献
中文文献
专利

1. Breakdown points for maximum likelihood estimators of location-scale mixtures [J] . Hennig C The Annals of Statistics: An Official Journal of the Institute of Mathematical Statistics . 2004,第4期

机译：位置尺度混合的最大似然估计的分解点
2. Strong consistency of the maximum likelihood estimator for finite mixtures of location-scale distributions when the scale parameters are exponentially small [J] . Tanaka K, Takemura A Bernoulli: official journal of the Bernoulli Society for Mathematical Statistics and Probability . 2006,第6期

机译：当比例尺参数呈指数形式变小时，最大似然估计器对于位置比例尺分布的有限混合具有很强的一致性
3. The location-scale mixture exponential power distribution: A Bayesian and maximum likelihood approach [J] . Rahnamaei Z., Nematollahi N., Farnoosh R. Journal of applied mathematics . 2012,第Pta11期

机译：位置尺度混合指数功率分布：贝叶斯和最大似然法
4. Consistency and Asymptotic Normality of the Maximum Likelihood Estimator in a Zero-inflated Poisson Mixture Distributions [C] . YANG Aijun, YANG Zhenhai 2009 International Institute of Applied Statistics Studies(2009 国际应用统计学术研讨会）论文集 . 2009

机译：零膨胀泊松混合分布中最大似然估计的一致性和渐近正态性
5. A comparison of estimators in hierarchical linear modeling: Restricted maximum likelihood versus bootstrap via minimum norm quadratic unbiased estimators. [D] . Delpish, Ayesha Nneka. 2006

机译：分层线性建模中估计量的比较：通过最小范数二次无偏估计量来限制最大似然与自举。
6. An Example of an Improvable Rao–Blackwell Improvement Inefficient Maximum Likelihood Estimator and Unbiased Generalized Bayes Estimator [O] . Tal Galili, Isaac Meilijson -1

机译：改进的Rao-Blackwell改进无效最大似然估计和无偏广义贝叶斯估计的示例
7. Breakdown points for maximum likelihood estimators of location-scale mixtures [O] . Hennig, Christian 2004

机译：位置尺度最大似然估计的细分点混合物

Breakdown points for maximum likelihood estimators of location-scale mixtures

摘要

著录项

相似文献

相关主题

期刊订阅