首页> 外文期刊>Advances in data analysis and classification >A robust approach to model-based classification based on trimming and constraints Semi-supervised learning in presence of outliers and label noise
【24h】

A robust approach to model-based classification based on trimming and constraints Semi-supervised learning in presence of outliers and label noise

机译:基于修剪和限制的基于模型分类的强大方法,在异常因素和标签噪声存在半监督学习

获取原文
获取原文并翻译 | 示例
       

摘要

In a standard classification framework a set of trustworthy learning data are employed to build a decision rule, with the final aim of classifying unlabelled units belonging to the test set. Therefore, unreliable labelled observations, namely outliers and data with incorrect labels, can strongly undermine the classifier performance, especially if the training size is small. The present work introduces a robust modification to the Model-Based Classification framework, employing impartial trimming and constraints on the ratio between the maximum and the minimum eigenvalue of the group scatter matrices. The proposed method effectively handles noise presence in both response and exploratory variables, providing reliable classification even when dealing with contaminated datasets. A robust information criterion is proposed for model selection. Experiments on real and simulated data, artificially adulterated, are provided to underline the benefits of the proposed method.
机译:在标准分类框架中,采用一组值得信赖的学习数据来构建决策规则,最终目标是对属于测试集的未标记单元进行分类。 因此,不可靠的标记观测,即具有错误标签的异常值和数据,可以强烈地破坏分类器性能,特别是如果训练大小很小。 本工作引入了对基于模型的分类框架的鲁棒修改,采用公正的修剪和对组散射矩阵的最大和最小特征值之间的比率的限制。 所提出的方法有效地处理响应和探索性变量中的噪声存在,即使在处理受污染的数据集时也提供可靠的分类。 提出了一种用于模型选择的强大信息标准。 提供了对实际和模拟数据的实验,人为掺假,以强调所提出的方法的益处。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号