首页> 外文期刊>Advances in data analysis and classification >Model-based clustering of high-dimensional data streams with online mixture of probabilistic PCA
【24h】

Model-based clustering of high-dimensional data streams with online mixture of probabilistic PCA

机译:基于模型的高维数据流与概率PCA在线混合的聚类

获取原文
获取原文并翻译 | 示例
       

摘要

Model-based clustering is a popular tool which is renowned for its probabilistic foundations and its flexibility. However, model-based clustering techniques usually perform poorly when dealing with high-dimensional data streams, which are nowadays a frequent data type. To overcome this limitation of model-based clustering, we propose an online inference algorithm for the mixture of probabilistic PCA model. The proposed algorithm relies on an EM-based procedure and on a probabilistic and incremental version of PCA. Model selection is also considered in the online setting through parallel computing. Numerical experiments on simulated and real data demonstrate the effectiveness of our approach and compare it to state-of-the-art online EM-based algorithms.
机译:基于模型的集群是一种流行的工具,以其概率基础和灵活性而闻名。但是,基于模型的聚类技术在处理高维数据流时通常表现不佳,而高维数据流是当今常见的数据类型。为了克服基于模型的聚类的这一局限性,我们针对概率PCA模型的混合提出了一种在线推理算法。所提出的算法依赖于基于EM的过程以及PCA的概率和增量版本。在线设置中还通过并行计算来考虑模型选择。在模拟和真实数据上进行的数值实验证明了我们方法的有效性,并将其与最新的在线基于EM的算法进行比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号