首页> 中文期刊> 《现代电子技术》 >优先聚类和高斯混合模型树相融合的递增聚类研究

优先聚类和高斯混合模型树相融合的递增聚类研究

         

摘要

The traditional clustering algorithms consume a large amount of time and memory for large dataset clustering, can′t adapt to the dynamic performance of big data flow,and have poor clustering stability. Therefore,an incremental clustering method based on partial-priority clustering and Gaussian mixture model tree is put forward. The partial-priority clustering algo-rithm is used to perform the priority clustering for large dataset,acquire the typical dataset,and reduce the data complexity of large dataset. And then the incremental clustering algorithm based on Gaussian mixture model tree is used to insert the data in typical dataset into a Gaussian mixture model tree to construct the Gaussian mixture model tree of the dataset. The leaf nodes and none-leaf nodes of the tree are matched with single Gaussian data distribution and Gaussian mixture model distribution re-spectively. According the insertion results,the Gaussian mixture model tree is adjusted,the data inserted into the model should be deleted whether or not is detected,and data deletion is accomplished. The breadth-first method is adopted to get the best tree node as the final clustering result. The experimental results indicate that the proposed incremental clustering algorithm has per-fect clustering effect,strong expansibility,and high stability.%传统聚类算法实现大数据集聚类时,耗费大量的时间和内存,无法适应大数据流的动态性,聚类稳定性较差.因此,提出基于优先聚类和高斯混合模型树的递增聚类方法.采用优先聚类算法对大数据集进行优先聚类,获取典型数据集,降低大数据集的数据复杂度,采用高斯混合模型树的递增聚类算法,将典型数据集中的数据插入到高斯混合模型树内,塑造数据集的高斯混合模型树,树的叶子节点和非叶子节点分别同单高斯数据分布和高斯混合模型分布对应,基于插入结果对高斯混合模型树实施调整,检测插入到模型树内的数据是否需要删除,并完成数据的删除操作,采用广度优先方法获取最佳的树节点作为最终的聚类结果.实验结果表明该算法取得了很好的效果,具有较高的可扩展性和稳定性.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号