首页> 美国卫生研究院文献>BioMed Research International >K Important Neighbors: A Novel Approach to Binary Classification in High Dimensional Data
【2h】

K Important Neighbors: A Novel Approach to Binary Classification in High Dimensional Data

机译:K个重要邻居:高维数据中二进制分类的一种新方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

K nearest neighbors (KNN) are known as one of the simplest nonparametric classifiers but in high dimensional setting accuracy of KNN are affected by nuisance features. In this study, we proposed the K important neighbors (KIN) as a novel approach for binary classification in high dimensional problems. To avoid the curse of dimensionality, we implemented smoothly clipped absolute deviation (SCAD) logistic regression at the initial stage and considered the importance of each feature in construction of dissimilarity measure with imposing features contribution as a function of SCAD coefficients on Euclidean distance. The nature of this hybrid dissimilarity measure, which combines information of both features and distances, enjoys all good properties of SCAD penalized regression and KNN simultaneously. In comparison to KNN, simulation studies showed that KIN has a good performance in terms of both accuracy and dimension reduction. The proposed approach was found to be capable of eliminating nearly all of the noninformative features because of utilizing oracle property of SCAD penalized regression in the construction of dissimilarity measure. In very sparse settings, KIN also outperforms support vector machine (SVM) and random forest (RF) as the best classifiers.
机译:K最近邻(KNN)是最简单的非参数分类器之一,但在高维设置中,KNN的准确性受干扰特征影响。在这项研究中,我们提出了K重要邻居(KIN)作为高维问题中二进制分类的一种新方法。为避免维数的诅咒,我们在初始阶段实施了平滑修剪的绝对偏差(SCAD)logistic回归,并考虑了在构建相异度度量中每个特征的重要性,并将特征贡献作为SCAD系数对欧几里德距离的函数。这种混合不相似性度量的性质结合了特征和距离的信息,同时具有SCAD惩罚回归和KNN的所有良好特性。与KNN相比,仿真研究表明KIN在准确性和降维方面均具有良好的性能。由于在相异性度量的构建中利用了SCAD罚回归的预言性质,因此该提议的方法能够消除几乎所有非信息性特征。在非常稀疏的环境中,KIN的最佳分类器也胜过支持向量机(SVM)和随机森林(RF)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号