带有缺失数据的一种动态聚类方法

肖静; 骆如九; 宋雯; 汤在祥; 徐辰武

首页> 中文期刊> 《中国农业科学》 >带有缺失数据的一种动态聚类方法

带有缺失数据的一种动态聚类方法

开具论文收录证明 >>

期刊封面封底目录下载 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

[Objective] The aim of the study is to investigate a clustering method for clustering the data with missing values in practice research. [Method] The paper introduces a maximum likelihood-based dynamic clustering method, which could configure a complete data set through the maximum likelihood estimation for the missing by statistics of the others. The parameters of missing data and different clusters are estimated by the maximum likelihood method implemented via expectation-maximization (EM) algorithm and the objects are classified by the Bayesian posterior probability. [Result I The results of simulation studies show that the proposed method not only has fast convergence speed but also accurately cluster the data with missing values. [Conclusion] The proposed method was further validated by Fisher's Iris dataset. The result indicated that the proposed method had a significant advantage on clustering accuracy compared to the delete missing data arithmetic and it is similar to complete data clustering algorithm.%[目的]探讨实际问题研究中的不完全数据聚类.[方法]利用相关变量的辅助信息,对缺失数据进行推估,确定其合理的替代值,从而构造出一个“完全”数据集.在此基础上以EM算法循环迭代,参数的估计值和缺失数据的替代值都将逐渐收敛,以相应的贝叶斯后验概率判别个体的归类,进而实现动态聚类.[结果]模拟研究表明,缺值替代法具有较好的收敛性,对有缺失的数据基本都可正确地聚类.[结论]Fisher的鸢尾花花类识别数据验证了缺值替代法的可行性,其聚类的准确性高于缺值删除法,基本接近完全数据聚类.

著录项

来源
《中国农业科学》 |2012年第21期|4534-4542|共9页
作者
肖静; 骆如九; 宋雯; 汤在祥; 徐辰武;
展开▼
作者单位

南通大学公共卫生学院流行病与卫生统计学教研室;

江苏南通226019;

扬州大学江苏省作物遗传生理重点实验室;

江苏扬州225009;

扬州大学江苏省作物遗传生理重点实验室;

江苏扬州225009;

苏州大学医学部公共卫生学院流行病与卫生统计学教研室;

江苏苏州215123;

扬州大学江苏省作物遗传生理重点实验室;

江苏扬州225009;

展开▼
原文格式 PDF
正文语种 chi
中图分类
关键词
聚类分析; 缺失数据; 后验概率; 极大似然估计;

相似文献

中文文献
外文文献
专利

1. 一种带有导向性的聚类方法在电信客户细分中的应用 [J] . 黄海 ,林齐宁 . 北京邮电大学学报（社会科学版） . 2006,第001期
2. 一种带有Fuzzy聚类方法的ABC分析 [J] . 程承运 . 葛洲坝水电工程学院学报 . 1995,第003期
3. 基于k-means聚类方法的曲线按比伸缩置换缺失数据补全法 [J] . 杨亚洲 ,钱秋明 ,梁鸭红 . 电气自动化 . 2021,第002期
4. 一种基于似然极大的动态聚类方法及其应用 [J] . 肖静 ,胡治球 ,王学枫 . 作物学报 . 2007,第001期
5. 基于免疫原理的一种动态数据聚类方法 [J] . 张雷 ,李人厚 . 控制与决策 . 2007,第4期
6. 一种基于动态自适应数据窗口的模糊k-均值聚类缺失数据估算算法 [C] . 廖再飞 ,吕新杰 ,罗雄飞 . NDBC2009第26届中国数据库学术会议 . 2009
7. 多指标综合评价的非参数方法和缺失数据的聚类方法研究 [A] . 骆汝九 . 2011

带有缺失数据的一种动态聚类方法

摘要

著录项

相似文献

相关主题

期刊订阅