首页> 中文期刊> 《计算机应用》 >基于随机矩阵理论的高维数据特征选择方法

基于随机矩阵理论的高维数据特征选择方法

         

摘要

传统特征选择方法多是通过相关度量来去除冗余特征,并没有考虑到高维相关矩阵中会存在大量的噪声,严重地影响特征选择结果.为解决此问题,提出基于随机矩阵理论(RMT)的特征选择方法.首先,将相关矩阵中符合随机矩阵预测的奇异值去除,从而得到去噪后的相关矩阵和选择特征的数量;然后,对去噪后的相关矩阵进行奇异值分解,通过分解矩阵获得特征与类的相关性;最后,根据特征与类的相关性和特征之间冗余性完成特征选择.此外,还提出一种特征选择优化方法,通过依次将每一个特征设为随机变量,比较其奇异值向量与原始奇异值向量的差异来进一步优化结果.分类实验结果表明所提方法能够有效提高分类准确率,减小训练数据规模.%The traditional feature selection methods always remove redundant features by using correlation measures,and it is not considered that there is a large amount of noise in a high-dimensional correlation matrix,which seriously affects the feature selection result.In order to solve the problem,a feature selection method based on Random Matrix Theory (RMT) was proposed.Firstly,the singular values of a correlation matrix which met the random matrix prediction were removed,thereby the denoised correlation matrix and the number of selected features were obtained.Then,the singular value decomposition was performed on the denoised correlation matrix,and the correlation between feature and class was obtained by decomposed matrix.Finally,the feature selection was accomplished according to the correlation between feature and class and the redundancy between features.In addition,a feature selection optimization method was proposed,which furtherly optimize the result by comparing the difference between singular value vector and original singular value vector and setting each feature as a random variable in turn.The classification experimental results show that the proposed method can effectively improve the classification accuracy and reduce the training data scale.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号