首页> 中文期刊> 《计算机应用》 >基于样本空间分布密度的改进次胜者受罚竞争学习算法

基于样本空间分布密度的改进次胜者受罚竞争学习算法

     

摘要

针对传统次胜者受罚竞争学习(RPCL)算法忽略数据集几何结构对节点权值调整的影响,以及魏立梅等提出的新RPCL算法(魏立梅,谢维信.聚类分析中竞争学习的一种新算法.电子科学学刊,2000,22(1):13 -18)引入密度来对节点的权值进行调整时,密度定义的主观性,提出基于样本空间分布密度的改进RPCL算法.该算法根据数据集样本自然分布定义样本密度,将此密度引入RPCL节点权值调整;使用UCI机器学习数据库数据集以及随机生成的带有噪声点的人工模拟数据集对算法进行实验测试,对算法确定数据集类簇数目的准确率、运行时间、聚类误差平方和、聚类结果的Rand指数、Jaccard系数以及Adjust Rand index参数进行分析比较.各项实验结果显示:所提算法优于原始RPCL算法和魏立梅算法,具有更好的聚类效果,对噪声数据有很强的抗干扰性能.所提算法不仅能根据样本的自然分布确定数据集的合理类簇数目,而且能确定合适的类簇中心,提高聚类的准确性,使聚类结果尽可能快地收敛到全局最优解.%The original Rival Penalized Competitive Learning ( RPCL) algorithm ignores the influence of the geometry structure of a dataset on the weight variation of its nodes. A new RPCL algorithm proposed by Wei Limei et al. (WEI LIMEI, XIE WEIXIN. A new competitive learning algorithm for clustering analysis. Journal of Electronics, 2000, 22(1): 13-18) overcame the drawback of the original RPCL by introducing the density of samples to adjust the weights of nodes, while the density was not much objective. This paper defined a new density for a sample according to the pattern distribution of samples in a dataset, and introduced the density into the adjusting for the weights of nodes in RPCL to overcome the disadvantages of the available RPCL algorithms. The authors' improved RPCL algorithm was tested on some well-known datasets from UCI machine learning repository and on some synthetic data sets with noisy samples. The accuracy of determining the number of clusters of a dataset and the run time and the clustering error of the algorithms were compared. The Rand index, the Jaccard coefficient and the Adjust Rand index were used to analyze the performance of the algorithms. The experimental results show that the improved RPCL algorithm outperforms the original RPCL and the new RPCL proposed by WEI LIMEI et al. Greatly, and achieves much better clustering results and has a stronger anti-interference performance for noisy data than that of the other two RPCL algorithms. All the analyses demonstrate that the improved RPCL algorithm can not only determine the right number of clusters for a dataset according to its sample distribution, but also uncover the suitable centers of clusters and advance the clustering accuracy as well as approximate the global optimal clustering result as fast as possible.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号