引入信息熵的CURE聚类算法

伍恒; 李文杰; 蒋旻

首页> 中文期刊>计算机应用研究 >引入信息熵的CURE聚类算法

引入信息熵的CURE聚类算法

开具论文收录证明 >>

期刊封面封底目录下载 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

为了提高传统CURE(clustering using representatives)聚类算法的质量,引入信息熵对其进行改进.该算法使用K-means算法对样本数据集进行预聚类;采用基于信息熵的相似性度量,利用簇中元素提供的信息度量不同簇之间的相互关系,并描述数据的分布;在高、低层聚类阶段,采取不同的选取策略,分别选取相应的代表点.在UCI和人造数据集上的实验结果表明,提出的算法在一定程度上提高了聚类的准确率,且在大型数据集上比传统CURE算法有着更高的聚类效率.%In order to improve the clustering quality of the traditional CURE algorithm, this paper proposed a modified CURE algorithm based on entropy.Firstly, this algorithm adopted K-means algorithm to cluster the sample data sets.Then, it introduced a similarity metric based on entropy to measure the relationship between clusters, this metric gathered information contained in the elements of the data sets, also described the distribution.Finally, in the low and high level of the clustering stage, it employed different strategies on representative points selection.The results of experiments on UCI data sets and synthetic data sets indicate that the proposed algorithm achieves better precision to some extent, and it gets better efficiency than the original CURE algorithm on large data sets.

著录项

来源
《计算机应用研究》|2017年第8期|2303-2305|共3页
作者
伍恒; 李文杰; 蒋旻;
展开▼
作者单位

武汉科技大学计算机科学与技术学院,武汉 430065;

武汉科技大学智能信息处理与实时工业系统湖北省重点实验室,武汉 430065;

武汉科技大学计算机科学与技术学院,武汉 430065;

武汉科技大学智能信息处理与实时工业系统湖北省重点实验室,武汉 430065;

武汉科技大学计算机科学与技术学院,武汉 430065;

武汉科技大学智能信息处理与实时工业系统湖北省重点实验室,武汉 430065;

展开▼
原文格式 PDF
正文语种 chi
中图分类算法理论;
关键词
层次聚类; CURE算法; 信息熵; 代表点选取;
入库时间 2022-08-18 05:02:15

相似文献

中文文献
外文文献
专利

1. 基于CURE聚类算法改进的原型选择算法 [J] . 孙元元 ,张德生 ,张晓 . 计算机系统应用 . 2019,第008期
2. 基于CURE聚类算法的科技情报异常数据检测 [J] . 柳兆峰 ,杨奇 ,霍永华 . 无线电通信技术 . 2018,第006期
3. 基于CURE聚类算法的静态R树构建方法 [J] . 李松 ,崔环宇 ,张丽平 . 计算机科学 . 2015,第010期
4. 基于CURE的用户聚类算法研究 [J] . 赵妍 ,赵学民 . 计算机工程与应用 . 2012,第011期
5. 基于改进CURE聚类算法的无监督异常检测方法 [J] . 周亚建 ,徐晨 ,李继国 . 通信学报 . 2010,第007期
6. 一种基于信息熵的空间聚类算法 [C] . 郑燕玲 . 2011全国开放式分布与并行计算学术年会 . 2011
7. 基于信息熵定义属性权重的混合数据聚类算法研究 [A] . 崔文秀 . 2021

引入信息熵的CURE聚类算法

摘要

著录项

相似文献

相关主题

期刊订阅