Highly Efficient Incremental Estimation of Gaussian Mixture Models for Online Data Stream Clustering

机译：用于在线数据流聚类的高斯混合模型的高效增量估计

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present a probability-density-based data stream clustering approach which requires only the newly arrived data, not the entire historical data, to be saved in memory. This approach incrementally updates the density estimate taking only the newly arrived data and the previously estimated density. The idea roots on a theorem of estimator updating and it works naturally with Gaussian mixture models. We implement it through the expectation maximization algorithm and a cluster merging strategy by multivariate statistical tests for equality of covariance and mean. Our approach is highly efficient in clustering voluminous online data streams when compared to the standard EM algorithm. We demonstrate the performance of our algorithm on clustering a simulated Gaussian mixture data stream and clustering real noisy spike signals extracted from neuronal recordings.

机译：我们提出了一种基于概率密度的数据流聚类方法，该方法仅需要将新到达的数据而不是整个历史数据保存在内存中。该方法仅采用新到达的数据和先前估计的密度来增量更新密度估计。这个想法基于估计量更新定理，并且可以自然地与高斯混合模型一起使用。我们通过期望最大化算法和聚类合并策略，通过多元统计检验来实现协方差和均值的均等性。与标准EM算法相比，我们的方法在集群大量在线数据流方面非常高效。我们展示了我们的算法在对模拟的高斯混合数据流进行聚类以及对从神经元录音中提取的真实噪声尖峰信号进行聚类的性能。

著录项

来源
《Intelligent Computing: Theory and Applications III》|2005年|P.174-183|共10页
会议地点 OrlandoFL(US)
作者
Mingzhou Song; Hongbin Wang;
展开▼
作者单位

Department of Computer Science, Queens College of CUNY, Flushing, NY 11367, USA;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类自动化系统理论;
关键词
data stream clustering; gaussian mixture models; expectation maximization; density merging;

机译：数据流聚类；高斯混合模型；期望最大化；密度合并;

相似文献

外文文献
中文文献
专利

1. Model-based clustering of high-dimensional data streams with online mixture of probabilistic PCA [J] . Bellas A., Bouveyron C., Cottrell M., Advances in data analysis and classification . 2013,第3期

机译：基于模型的高维数据流与概率PCA在线混合的聚类
2. Model-based clustering of high-dimensional data streams with online mixture of probabilistic PCA [J] . Anastasios Bellas, Charles Bouveyron, Marie Cottrell, Advances in Data Analysis and Classification . 2013,第3期

机译：基于模型的高维数据流与概率PCA在线混合的聚类
3. Online nonparametric Bayesian analysis of parsimonious Gaussian mixture models and scenes clustering [J] . Ri‐Gui Zhou, Wei Wang ETRI journal . 2021,第1期

机译：在线非参数贝叶斯分析解析高斯混合模型与场景聚类
4. Highly Efficient Incremental Estimation of Gaussian Mixture Models for Online Data Stream Clustering [C] . Mingzhou Song, Hongbin Wang Society of Photo-Optical Instrumentation Engineers Conference on Intelligent Computing : Theory and Applications . 2005

机译：在线数据流群集的高斯混合模型的高效增量估计
5. Efficient Incremental Model Learning on Data Streams [D] . Chen, Xilun. 2019

机译：高效增量模型在数据流上学习
6. mclust 5: Clustering Classification and Density Estimation Using Gaussian Finite Mixture Models [O] . Luca Scrucca, Michael Fop, T. Brendan Murphy, -1

机译：mclust 5：使用高斯有限混合模型的聚类分类和密度估计
7. Clustering of Data Streams With Dynamic Gaussian Mixture Models: An IoT Application in Industrial Processes [O] . Javier Diaz-Rozo, Concha Bielza, Pedro Larranaga 2018

机译：具有动态高斯混合模型的数据流群集：工业过程中的IOT应用

Highly Efficient Incremental Estimation of Gaussian Mixture Models for Online Data Stream Clustering

摘要

著录项

相似文献

相关主题

期刊订阅