首页> 外文会议>IEEE International Conference on Intelligent Computing and Intelligent Systems >Application of Gaussian Mixture Model Genetic Algorithm in data stream clustering analysis
【24h】

Application of Gaussian Mixture Model Genetic Algorithm in data stream clustering analysis

机译:高斯混合模型遗传算法在数据流聚类分析中的应用

获取原文

摘要

Data stream is infinite data and quick stream speed, so traditional clustering algorithm can not be applied to data stream clustering directly. As an efficient tool for data analysis, Gaussian mixture model has been widely applied in the fields of signal and information processing. We can use Gaussian mixture model (GMM) simulate arbitrary clustering graphics. There are two critical problems for the clustering analysis technology to select the appropriate value of number of clusters and partition overlapping clusters. Base on an extending method of Gaussian mixture modeling, a new feature mining method named Gaussian Mixture Model with Genetic Algorithms is proposed in this paper. This method is use a probability density based data stream clustering which requires only the newly arrived data, not the entire historical data, and also can choose optimal estimation clusters number value. The algorithm can determine the number of Gaussian clusters and the parameters of each Gaussian through random split and merge operation of Genetic Algorithms. We can get the accurate information each attribute characteristic describe. So that can make an effective date stream mining.
机译:数据流是无限的数据,流速度快,因此传统的聚类算法不能直接应用于数据流聚类。作为一种有效的数据分析工具,高斯混合模型已广泛应用于信号和信息处理领域。我们可以使用高斯混合模型(GMM)模拟任意聚类图形。对于聚类分析技术来说,选择聚类数量和分区重叠聚类的适当值存在两个关键问题。在高斯混合建模的扩展方法的基础上,提出了一种新的基于遗传算法的高斯混合模型特征挖掘方法。该方法使用基于概率密度的数据流聚类,该聚类仅需要新到达的数据,而不需要整个历史数据,并且还可以选择最佳估计聚类数值。该算法可以通过遗传算法的随机分裂和合并运算来确定高斯聚类的数量和每个高斯的参数。我们可以获得每个属性特征描述的准确信息。这样就可以进行有效的日期流挖掘。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号