...
首页> 外文期刊>Statistics and computing >Robust clustering tools based on optimal transportation
【24h】

Robust clustering tools based on optimal transportation

机译:基于最佳运输的强大聚类工具

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

A robust clustering method for probabilities in Wasserstein space is introduced. This new trimmed k-barycenters' approach relies on recent results on barycenters in Wasserstein space that allow intensive computation, as required by clustering algorithms to be feasible. The possibility of trimming the most discrepant distributions results in a gain in stability and robustness, highly convenient in this setting. As a remarkable application, we consider a parallelized clustering setup in which each of m units processes a portion of the data, producing a clustering report, encoded as k probabilities. We prove that the trimmed k-barycenter of the mxk reports produces a consistent aggregation which we consider the result of a wide consensus'. We also prove that a weighted version of trimmed k-means algorithms based on k-barycenters in the space of Wasserstein keeps the descending character of the concentration step, guaranteeing convergence to local minima. We illustrate the methodology with simulated and real data examples. These include clustering populations by age distributions and analysis of cytometric data.
机译:介绍了一种针对Wasserstein空间中概率的鲁棒聚类方法。这种新的修剪k重心的方法依赖于Wasserstein空间中重心的最新结果,可以根据聚类算法的要求进行密集的计算。修剪最不均匀的分布的可能性会导致稳定性和鲁棒性的提高,在这种情况下非常方便。作为一个杰出的应用程序,我们考虑并行化的聚类设置,其中m个单元中的每一个处理一部分数据,生成聚类报告,编码为k个概率。我们证明mxk报告的修剪后的k重心产生了一致的聚合,我们认为这是广泛共识的结果。我们还证明了Wasserstein空间中基于k重心的修剪k均值算法的加权版本保持了集中步长的下降特性,从而保证了收敛到局部极小值。我们通过模拟和真实数据示例来说明该方法。其中包括按年龄分布对人群进行聚类以及对细胞数据进行分析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号