一种适用于高维非线性特征数据的聚类算法及应用

姜洪权; 王岗; 高建民; 高智勇; 高瑞琪; 郭旗

首页> 中文期刊>西安交通大学学报 >一种适用于高维非线性特征数据的聚类算法及应用

一种适用于高维非线性特征数据的聚类算法及应用

开具论文收录证明 >>

期刊封面封底目录下载 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Aiming at the problems caused by the nonlinear relations between the attributes of high dimensional data in cluster analysis,such as uneven distribution of data,invalidation of traditional similarity measures and difficulty of accurate representation of the result class,a clustering algorithm for high dimensional nonlinear feature data is proposed based on kernel principal component analysis (KPCA) and density clustering (DBSCAN).To extract the nonlinear characteristics of high dimensional data,the KPCA theory is adopted to map the original to a higher dimensional data space,thus a set of directions in principal component spacePCS for extracting the nonlinear characteristics of data and reduced dimensions can be obtained.The similarity distance of data in PCS is defined to improve the traditional DBSCAN clustering algorithm and 3δ statistical theory is used to characterize the clustering results.A case of hypertension group clustering is provided to illustrate the feasibility of the proposed method,and the results show that the proposed method can effectively obtain the nonlinear characteristics of the high dimensional data and realize cluster analysis and cluster center knowledge expression to solve the difficulties in the traditional DBSCAN clustering method for cluster analysis of high dimensional data.%针对高维数据聚类分析中数据之间具有多种非线性特征关系,导致数据分布不均、传统相似性度量失效及结果类中心难以精准表征等问题,提出了一种基于核主元分析(KPCA)与密度聚类(DBSCAN)的高维非线性特征数据聚类分析技术.首先,为有效提取高维数据的非线性特征,利用KPCA理论将原始数据映射到更高维数据空间,利用主元分析获得数据变化的方向集合,并进行降维分析;然后,通过重新定义数据样本在主元空间的相似性距离对传统DBSCAN聚类方法进行改进,并利用3δ统计理论对各簇中心的进行表征,从而实现高维数据的精确分类与类中心知识表达.以实际高血压患者群体聚类问题为例对方法进行了有效性验证,实验表明,所提方法可以有效获取原始数据的非线性特征,实现患者个体特征群体的有效划分及簇类中心知识的表达,解决传统DBSCAN聚类方法对高维数据不适用的问题.

著录项

来源
《西安交通大学学报》|2017年第12期|49-55,90|共8页
作者
姜洪权; 王岗; 高建民; 高智勇; 高瑞琪; 郭旗;
展开▼
作者单位

西安交通大学机械制造系统工程国家重点实验室,710049,西安;

西安交通大学第二附属医院,710004,西安;

西安交通大学机械制造系统工程国家重点实验室,710049,西安;

西安交通大学机械制造系统工程国家重点实验室,710049,西安;

西安交通大学机械制造系统工程国家重点实验室,710049,西安;

西安交通大学第二附属医院,710004,西安;

展开▼
原文格式 PDF
正文语种 chi
中图分类检索机;
关键词
非线性; 高维数据; 核主元分析; 密度聚类;
入库时间 2023-07-25 11:08:17

相似文献

中文文献
外文文献
专利

1. 一种适合于非线性高维数据的谱聚类算法 [J] . 王鸿菲 ,杜洪波 ,林凯迪 . 计算机应用与软件 . 2021,第009期
2. 一种高维聚类算法及在洗钱侦测中的应用 [J] . 陈云开 ,卢正鼎 ,刘芳 . 计算机科学 . 2007,第006期
3. 一种结合贪心选择和特征加权的高维数据聚类算法 [J] . 向志华 ,邵亚丽 . 电子科技 . 2019,第011期
4. 一种改进的GP-CLIQUE自适应高维子空间聚类算法 [J] . 肖红光 ,谭雯 ,邓国群 . 测控技术 . 2018,第004期
5. 一种改进的SUBCLU高维子空间聚类算法 [J] . 罗靖 ,钱雪忠 ,韩利钊 . 计算机工程与应用 . 2017,第014期
6. 一种高维空间数据的模糊聚类算法 [C] . 杨悦 ,张健沛 ,李忠伟 . 第十六届中国神经网络大会(CNNC2006)暨首届中国人工免疫系统专题会议(CAISC06) . 2006
7. 在聚类中关于噪音与高维问题的研究——一种快速鲁棒的映射聚类算法的研究及应用 [A] . 周霆 . 2006

一种适用于高维非线性特征数据的聚类算法及应用

摘要

著录项

相似文献

相关主题

期刊订阅