首页> 外文会议> >Classification performance of various real-life data sets when the features are discretized
【24h】

Classification performance of various real-life data sets when the features are discretized

机译:离散化特征后各种现实数据集的分类性能

获取原文

摘要

The Bayesian data reduction algorithm is applied to a collection of thirty real-life data sets primarily found at the University of California at Irvine's Repository of Machine Learning databases. The algorithm works by finding the best performing quantization complexity of the feature vectors, and this makes it necessary to discretize all continuous valued features. Therefore, results are given by showing the initial quantization of the continuous valued features that yields best performance. Further, the Bayesian data reduction algorithm is also compared to a conventional linear classifier, which does not discretize any feature values. In general, the Bayesian data reduction algorithm outperforms the linear classifier by obtaining a lower probability of error, as averaged over all thirty data sets.
机译:贝叶斯数据约简算法应用于三十个现实生活中的数据集的集合,这些数据集主要在加利福尼亚大学尔湾分校的机器学习数据库中找到。该算法通过找到特征向量的最佳执行量化复杂度来工作,因此有必要离散化所有连续值的特征。因此,通过显示产生最佳性能的连续值特征的初始量化来给出结果。此外,还将贝叶斯数据约简算法与不离散任何特征值的常规线性分类器进行比较。通常,贝叶斯数据约简算法通过获得较低的错误概率(优于所有三十个数据集的平均值),胜过线性分类器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号