...
【24h】

On nearest neighbor classification using adaptive choice of k

机译:关于使用k的自适应选择的最近邻分类

获取原文
获取原文并翻译 | 示例
           

摘要

Nearest neighbor classification is one of the simplest and popular methods for statistical pattern recognition. It classifies an observation x to the class, which is the most frequent in the neighborhood of x. The size of this neighborhood is usually determined by a predefined parameter k. Normally, one uses cross-validation techniques to estimate the optimum value of this parameter, and that estimated value is used for classifying all observations. However, in classification problems, in addition to depending on the training sample, a good choice of k depends on the specific observation to be classified. Therefore, instead of using a fixed value of k over the entire measurement space, a spatially adaptive choice of k may be more useful in practice. This article presents one such adaptive nearest neighbor classification technique, where the value of k is selected depending on the distribution of competing classes in the vicinity of the observation to be classified. The utility of the proposed method has been illustrated using some simulated examples and well-known benchmark datasets. Asymptotic optimality of its misclassification rate has been derived under appropriate regularity conditions.
机译:最近邻分类是用于统计模式识别的最简单和流行的方法之一。它将观测值x归类为类别,该类别是x附近最常见的。该邻域的大小通常由预定义参数k确定。通常,人们使用交叉验证技术来估计该参数的最佳值,并且该估计值用于对所有观测值进行分类。但是,在分类问题中,除了取决于训练样本外,k的好选择还取决于要分类的特定观察值。因此,代替在整个测量空间上使用固定的k值,在实践中,k的空间自适应选择可能更有用。本文介绍了一种此类自适应最近邻分类技术,其中,根据要分类的观测值附近竞争类的分布来选择k的值。已使用一些模拟示例和众所周知的基准数据集说明了该方法的实用性。在适当的规律性条件下,得出了误分类率的渐近最优性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号