基于密度的K-means算法在轨迹数据聚类中的优化

郝美薇; 戴华林; 郝琨

首页> 中文期刊> 《计算机应用》 >基于密度的K-means算法在轨迹数据聚类中的优化

基于密度的K-means算法在轨迹数据聚类中的优化

开具论文收录证明 >>

期刊封面封底目录下载 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

针对传统的K-means算法无法预先明确聚类数目,对初始聚类中心选取敏感且易受离群孤点影响导致聚类结果稳定性和准确性欠佳的问题,提出一种改进的基于密度的K-means算法.该算法首先基于轨迹数据分布密度和增加轨迹数据关键点密度权值的方式选取高密度的轨迹数据点作为初始聚类中心进行K-means聚类,然后结合聚类有效函数类内类外划分指标对聚类结果进行评价,最后根据评价确定最佳聚类数目和最优聚类划分.理论研究与实验结果表明,该算法能够更好地提取轨迹关键点,保留关键路径信息,且与传统的K-means算法相比,聚类准确性提高了28个百分点,与具有噪声的基于密度的聚类算法相比,聚类准确性提高了17个百分点.所提算法在轨迹数据聚类中具有更好的稳定性和准确性.%Since the traditional K-means algorithm can hardly predefine the number of clusters,and performs sensitively to the initial clustering centers and outliers,which may result in unstable and inaccurate results,an improved density-based K-means algorithm was proposed.Firstly,high-density trajectory data points were selected as the initial clustering centers to perform K-means clustering by considering the density of the trajectory data distribution and increasing the weight of the density of important points.Secondly,the clustering results were evaluated by the Between-Within Proportion (BWP) index of cluster validity function.Finally,the optimal number of clusters and clustering were determined according to the clustering results evaluation.Theoretical researches and experimental results show that the improved algorithm can be better at extracting the trajectory key points and keeping the key path information.The accuracy of clustering results was 28 percentage points higher than that of the traditional K-means algorithm and 17 percentage points higher than that of the Density-Based Spatial Clustering of Applications with Noise (DBSCAN) algorithm.The proposed algorithm has a better stability and a higher accuracy in trajectory data clustering.

著录项

来源
《计算机应用》 |2017年第10期|2946-2951|共6页
作者
郝美薇; 戴华林; 郝琨;
展开▼
作者单位

天津城建大学计算机与信息工程学院;

天津300384;

天津城建大学计算机与信息工程学院;

天津300384;

天津城建大学计算机与信息工程学院;

天津300384;

展开▼
原文格式 PDF
正文语种 chi
中图分类算法理论;
关键词
K-means算法; 基于密度; 车辆活动特征; 密度权值; 初始聚类中心; 类内类外划分指标;

相似文献

中文文献
外文文献
专利

1. 基于大数据聚类算法K-means的用户分群 [J] . 苏进 . 数字通信世界 . 2021,第006期
2. 基于密度优化初始聚类中心的K-means算法 [J] . 王艳娥 ,安健 ,梁艳 . 计算机技术与发展 . 2020,第012期
3. 基于密度峰值优化初始中心K-means算法在风力发电系统的故障诊断分析 [J] . 叶永恩 ,王欣 ,黄浩 . 新型工业化 . 2017,第010期
4. 基于密度峰值优化的K-means文本聚类算法 [J] . 田诗宵 ,丁立新 ,郑金秋 . 计算机工程与设计 . 2017,第004期
5. 基于密度的优化初始聚类中心K-means算法研究 [J] . 何佳知 ,谢颖华 . 微型机与应用 . 2015,第019期
6. 基于密度和加权欧氏距离的K-means算法 [C] . ZHAO Yan-ping ,赵延平 ,GE Jing . 中国计算机用户协会网络应用分会2013年第十七届网络新技术与应用年会 . 2013
7. K-means算法的改进及其在文本数据聚类中的应用 [A] . 王玉洁 . 2016

基于密度的K-means算法在轨迹数据聚类中的优化

摘要

著录项

相似文献

相关主题

期刊订阅