...
首页> 外文期刊>Journal of applied statistics >Robust mixture model cluster analysis using adaptive kernels
【24h】

Robust mixture model cluster analysis using adaptive kernels

机译:使用自适应内核的鲁棒混合模型聚类分析

获取原文
获取原文并翻译 | 示例
           

摘要

The traditional mixture model assumes that a dataset is composed of several populations of Gaussian distributions. In real life, however, data often do not fit the restrictions of normality very well. It is likely that data from a single population exhibiting either asymmetrical or heavy-tail behavior could be erroneously modeled as two populations, resulting in suboptimal decisions. To avoid these pitfalls, we generalize the mixture model using adaptive kernel density estimators. Because kernel density estimators enforce no functional form, we can adapt to non-normal asymmetric, kurtotic, and tail characteristics in each population independently. This, in effect, robustifies mixture modeling. We adapt two computational algorithms, genetic algorithm with regularized Mahalanobis distance and genetic expectation maximization algorithm, to optimize the kernel mixture model (KMM) and use results from robust estimation theory in order to data-adaptively regularize both. Finally, we likewise extend the information criterion ICOMP to score the KMM. We use these tools to simultaneously select the best mixture model and classify all observations without making any subjective decisions. The performance of the KMM is demonstrated on two medical datasets; in both cases, we recover the clinically determined group structure and substantially improve patient classification rates over the Gaussian mixture model.
机译:传统的混合模型假设数据集由几个高斯分布总体组成。但是,在现实生活中,数据通常无法很好地满足正常性的限制。来自单个种群表现出不对称或重尾行为的数据可能被错误地建模为两个种群,从而导致次优决策。为避免这些陷阱,我们使用自适应核密度估计器对混合模型进行了概括。由于核密度估计器不强制执行任何函数形式,因此我们可以独立地适应每个总体中的非正态不对称,峰度和尾部特征。实际上,这使混合物建模更加可靠。我们采用两种计算算法(具有正则化的Mahalanobis距离的遗传算法和遗传期望最大化算法)来优化内核混合模型(KMM),并使用鲁棒估计理论的结果来对两者进行数据自适应正则化。最后,我们同样将信息标准ICOMP扩展为KMM。我们使用这些工具同时选择最佳的混合模型并分类所有观察结果,而无需做出任何主观决定。 KMM的性能在两个医疗数据集上得到了证明;在这两种情况下,我们都恢复了临床确定的组结构,并大大提高了高斯混合模型的患者分类率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号