【24h】

Decontamination of Training Samples for Supervised Pattern Recognition Methods

机译:用于监督模式识别方法的训练样本的去污

获取原文
获取原文并翻译 | 示例

摘要

The present work discusses what have been called 'imperfectly supervised situations': pattern recognition applications where the assumption of label correctness does not hold for all the elements of the training sample. A methodology for contending with these practical situations and to avoid their negative impact on the performance of supervised methods is presented. This methodology can be regarded as a cleaning process removing some suspicious instances of the training sample or correcting the class labels of some others while retaining them. It has been conceived for doing classification with the Nearest Neighbor rule, a supervised nonparametric classifier that combines conceptual simplicity and an asymptotic error rate bounded in terms of the optimal Bayes error. However, initial experiments concerning the learning phase of a Multilayer Perceptron (not reported in the present work) seem to indicate a broader applicability. Results with both simulated and real data sets are presented to support the methodology and to clarify the ideas behind it. Related works are briefly reviewed and some issues deserving further research are also exposed.
机译:本工作讨论了所谓的“不完全监督情况”:模式识别应用程序,其中标签正确性的假设并不适用于训练样本的所有元素。提出了一种应对这些实际情况并避免其对有监督方法的性能产生负面影响的方法。这种方法可以看作是清除过程,以除去训练样本的一些可疑实例或在保留它们的同时更正其他一些样本的类别标签。它被认为是用最近邻规则进行分类的,该规则是有监督的非参数分类器,其结合了概念上的简单性和以最佳贝叶斯误差为界的渐近误差率。但是,有关多层感知器学习阶段的初步实验(当前工作中未报道)似乎表明了更广泛的适用性。给出了包含模拟和真实数据集的结果,以支持该方法并阐明其背后的思想。简要回顾了相关工作,并揭露了一些值得进一步研究的问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号