首页> 外文会议> >Classification performance of various real-life data sets when the features are discretized

【24h】

Classification performance of various real-life data sets when the features are discretized

机译：离散化特征后各种现实数据集的分类性能

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The Bayesian data reduction algorithm is applied to a collection of thirty real-life data sets primarily found at the University of California at Irvine's Repository of Machine Learning databases. The algorithm works by finding the best performing quantization complexity of the feature vectors, and this makes it necessary to discretize all continuous valued features. Therefore, results are given by showing the initial quantization of the continuous valued features that yields best performance. Further, the Bayesian data reduction algorithm is also compared to a conventional linear classifier, which does not discretize any feature values. In general, the Bayesian data reduction algorithm outperforms the linear classifier by obtaining a lower probability of error, as averaged over all thirty data sets.

机译：贝叶斯数据约简算法应用于三十个现实生活中的数据集的集合，这些数据集主要在加利福尼亚大学尔湾分校的机器学习数据库中找到。该算法通过找到特征向量的最佳执行量化复杂度来工作，因此有必要离散化所有连续值的特征。因此，通过显示产生最佳性能的连续值特征的初始量化来给出结果。此外，还将贝叶斯数据约简算法与不离散任何特征值的常规线性分类器进行比较。通常，贝叶斯数据约简算法通过获得较低的错误概率（优于所有三十个数据集的平均值），胜过线性分类器。

著录项

来源
《》|2001年|P.753-758|共6页
会议地点
作者
Lynch; R.S.; Jr.; Willett; P.K.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. A fuzzy-based data transformation for feature extraction to increase classification performance with small medical data sets [J] . Der-Chiang Li, Chiao-Wen Liu, Susan C. Hu Artificial intelligence in medicine . 2011,第1期

机译：基于模糊的数据转换，用于特征提取，以提高小型医疗数据集的分类性能
2. Effects of data set features on the performances of classification algorithms [J] . Ohbyung Kwon, Jae Mun Sim Expert Systems with Application . 2013,第5期

机译：数据集特征对分类算法性能的影响
3. The Comparison Of Pca And Discrete Rough Set For Feature Extraction Of Remote Sensing Image Classification - A Case Study On Rice Classification, Taiwan [J] . T. C. Lei, S. Wan, T. Y. Chou Computational Geosciences . 2008,第1期

机译：基于Pca和离散粗糙集的遥感图像分类特征提取的比较-以台湾水稻分类为例
4. Classification performance of various real-life data sets when the features are discretized [C] . Lynch R.S. Jr., Willett P.K., Institute of Electric and Electronic Engineer IEEE International Conference on Systems, Man, and Cybernetics . 2001

机译：当特征离散时，各种现实数据集的分类性能
5. The integration of seismic anisotropy and reservoir performance data for characterization of naturally fractured reservoirs using discrete feature network models. [D] . Will, Robert. 2004

机译：地震各向异性和储层性能数据的集成，用于使用离散特征网络模型表征天然裂缝性储层。
6. Peculiar Genes Selection: A new features selection method to improve classification performances in imbalanced data sets [O] . Federica Martina, Marco Beccuti, Gianfranco Balbo, -1

机译：特殊基因选择：一种新的特征选择方法可改善不平衡数据集中的分类性能
7. Peculiar Genes Selection: A new features selection method to improve classification performances in imbalanced data sets. [O] . Federica Martina, Marco Beccuti, Gianfranco Balbo, 2017

机译：特殊基因选择：一种改进不平衡数据集分类性能的新特征选择方法。

Classification performance of various real-life data sets when the features are discretized

摘要

著录项

相似文献

相关主题

期刊订阅