首页> 外文期刊>Journal of classification >Robustification of Gaussian Bayes Classifier by the Minimum -Divergence Method
【24h】

Robustification of Gaussian Bayes Classifier by the Minimum -Divergence Method

机译:高斯贝叶斯分类器的最低方法稳健 - 方法

获取原文
获取原文并翻译 | 示例
           

摘要

The goal of classification is to classify new objects into one of the several known populations. A common problem in most of the existing classifiers is that they are very much sensitive to outliers. To overcome this problem, several author's attempt to robustify some classifiers including Gaussian Bayes classifiers based on robust estimation of mean vectors and covariance matrices. However, these type of robust classifiers work well when only training datasets are contaminated by outliers. They produce misleading results like the traditional classifiers when the test data vectors are contaminated by outliers as well. Most of them also show weak performance if we gradually increase the number of variables in the dataset by fixing the sample size. As the remedies of these problems, an attempt is made to propose a highly robust Gaussian Bayes classifiers by the minimum -divergence method. The performance of the proposed method depends on the value of tuning parameter , initialization of Gaussian parameters, detection of outlying test vectors, and detection of their variable-wise outlying components. We have discussed some techniques in this paper to improve the performance of the proposed method by tackling these issues. The proposed classifier reduces to the MLE-based Gaussian Bayes classifier when 0. The performance of the proposed method is investigated using both synthetic and real datasets. It is observed that the proposed method improves the performance over the traditional and other robust linear classifiers in presence of outliers. Otherwise, it keeps equal performance.
机译:分类的目标是将新对象分类为几个已知人群中的一个。大多数现有分类器中的常见问题是它们对异常值非常敏感。为了克服这个问题,一些作者试图根据均高估计平均向量和协方差矩阵,强制一些分类器,包括高斯贝叶斯分类器。然而,当只有训练数据集被异常值污染时,这些类型的稳健分类器工作很好。当测试数据向量也被异常值污染时,它们会产生传统分类器的误导性结果。如果我们通过修复示例大小逐渐增加数据集中的变量数量,它们中的大多数也会显示出薄弱的性能。作为这些问题的补救措施,尝试通过最小的方法提出高度强大的高斯贝叶斯分类器。所提出的方法的性能取决于调谐参数的值,高斯参数的初始化,偏远测试向量的检测,以及检测其可变明智的偏远组件。我们已经讨论了本文中的一些技术,以通过解决这些问题来提高所提出的方法的性能。当0时,所提出的分类器减少到基于MLE的高斯贝贝斯分类器。使用合成和实时数据集调查所提出的方法的性能。观察到所提出的方法在存在异常值的情况下提高了传统和其他强大的线性分类器的性能。否则,它会保持平等的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号