首页> 外文会议>Intelligent Computing: Theory and Applications III >Highly Efficient Incremental Estimation of Gaussian Mixture Models for Online Data Stream Clustering
【24h】

Highly Efficient Incremental Estimation of Gaussian Mixture Models for Online Data Stream Clustering

机译:用于在线数据流聚类的高斯混合模型的高效增量估计

获取原文
获取原文并翻译 | 示例

摘要

We present a probability-density-based data stream clustering approach which requires only the newly arrived data, not the entire historical data, to be saved in memory. This approach incrementally updates the density estimate taking only the newly arrived data and the previously estimated density. The idea roots on a theorem of estimator updating and it works naturally with Gaussian mixture models. We implement it through the expectation maximization algorithm and a cluster merging strategy by multivariate statistical tests for equality of covariance and mean. Our approach is highly efficient in clustering voluminous online data streams when compared to the standard EM algorithm. We demonstrate the performance of our algorithm on clustering a simulated Gaussian mixture data stream and clustering real noisy spike signals extracted from neuronal recordings.
机译:我们提出了一种基于概率密度的数据流聚类方法,该方法仅需要将新到达的数据而不是整个历史数据保存在内存中。该方法仅采用新到达的数据和先前估计的密度来增量更新密度估计。这个想法基于估计量更新定理,并且可以自然地与高斯混合模型一起使用。我们通过期望最大化算法和聚类合并策略,通过多元统计检验来实现协方差和均值的均等性。与标准EM算法相比,我们的方法在集群大量在线数据流方面非常高效。我们展示了我们的算法在对模拟的高斯混合数据流进行聚类以及对从神经元录音中提取的真实噪声尖峰信号进行聚类的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号