首页> 外文OA文献 >A graph-based semi-supervised k nearest-neighbor method for nonlinear manifold distributed data classification
【2h】

A graph-based semi-supervised k nearest-neighbor method for nonlinear manifold distributed data classification

机译:基于图的半监督k近邻法非线性流形分布式数据分类

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

k nearest neighbors (kNN) is one of the most widely used supervised learning algorithms to classify Gaussian distributed data, but it does not achieve good results when it is applied to nonlinear manifold distributed data, especially when a very limited amount of labeled samples are available. In this paper, we propose a new graph-based kNN algorithm which can effectively handle both Gaussian distributed data and nonlinear manifold distributed data. To achieve this goal, we first propose a constrained Tired Random Walk (TRW) by constructing an R-level nearest-neighbor strengthened tree over the graph, and then compute a TRW matrix for similarity measurement purposes. After this, the nearest neighbors are identified according to the TRW matrix and the class label of a query point is determined by the sum of all the TRW weights of its nearest neighbors. To deal with online situations, we also propose a new algorithm to handle sequential samples based a local neighborhood reconstruction. Comparison experiments are conducted on both synthetic data sets and real-world data sets to demonstrate the validity of the proposed new kNN algorithm and its improvements to other version of kNN algorithms. Given the widespread appearance of manifold structures in real-world problems and the popularity of the traditional kNN algorithm, the proposed manifold version kNN shows promising potential for classifying manifold-distributed data.
机译:k最近邻(kNN)是用于对高斯分布数据进行分类的最广泛使用的监督学习算法之一,但是将其应用于非线性流形分布数据时,尤其是在数量非常有限的标记样本可用时,其效果不佳。 。在本文中,我们提出了一种新的基于图的kNN算法,该算法可以有效处理高斯分布数据和非线性流形分布数据。为了实现此目标,我们首先通过在图上构造R级最近邻居增强树来提出约束疲倦的随机游走(TRW),然后为相似性测量目的计算TRW矩阵。此后,根据TRW矩阵标识最近的邻居,并根据查询点的所有TRW权重之和确定查询点的类别标签。为了处理在线情况,我们还提出了一种基于局部邻域重构的新算法来处理顺序样本。在合成数据集和实际数据集上都进行了比较实验,以证明所提出的新kNN算法的有效性及其对其他版本kNN算法的改进。鉴于流形结构在实际问题中的广泛出现以及传统kNN算法的普及,提出的流形版本kNN显示了对流形分布数据进行分类的有希望的潜力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号