首页> 外文会议>Australian Joint Conference on Artificial Intelligence; 20041204-06; Cairns(AU) >Naive Bayes Classifiers That Perform Well with Continuous Variables
【24h】

Naive Bayes Classifiers That Perform Well with Continuous Variables

机译:朴素贝叶斯分类器在连续变量方面表现良好

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

There are three main methods for handling continuous variables in naive Bayes classifiers, namely, the normal method (parametric approach), the kernel method (non parametric approach) and discretization. In this article, we perform a methodologically sound comparison of the three methods, which shows large mutual differences of each of the methods and no single method being universally better. This suggests that a method for selecting one of the three approaches to continuous variables could improve overall performance of the naive Bayes classifier. We present three methods that can be implemented efficiently υ-fold cross validation for the normal, kernel and discretization method. Empirical evidence suggests that selection using 10 fold cross validation (especially when repeated 10 times) can largely and significantly improve over all performance of naive Bayes classifiers and consistently outperform any of the three popular methods for dealing with continuous variables on their own. This is remarkable, since selection among more classifiers does not consistently result in better accuracy.
机译:在朴素贝叶斯分类器中,有三种处理连续变量的主要方法,即普通方法(参数方法),核方法(非参数方法)和离散化。在本文中,我们对这三种方法进行了方法论上的合理比较,这显示了每种方法之间的巨大差异,没有任何一种方法能普遍地更好。这表明选择三种连续变量方法之一的方法可以提高朴素贝叶斯分类器的整体性能。我们介绍了三种可以有效地实现常规,内核和离散化方法的υ折叠交叉验证的方法。经验证据表明,使用10倍交叉验证(特别是重复10次)的选择可以在很大程度上改善朴素贝叶斯分类器的所有性能,并且始终优于三种自行处理连续变量的流行方法中的任何一种。这是惊人的,因为在更多分类器中进行选择并不能始终带来更高的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号