首页> 外文会议>Machine learning(ML95) >Supervised and Unsupervised Discretization of Continuous Features

【24h】

Supervised and Unsupervised Discretization of Continuous Features

机译：连续特征的有监督和无监督离散化

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Many supervised machine learning algorithms require a discrete feature space. In this paper, we review previous work on continuous feature discretization, identify defining characteristics of the methods, and conduct an empirical evaluation of several methods. We compare binning, an unsupervised discretization method, to entropy-based and purity-based methods, which are supervised algorithms. We found that the performance of the Naive-Bayes algorithm significantly improved when features were discretized using an entropy-based method. In fact, over the 16 tested datasets, the discretized version of Naive-Bayes slightly outperformed C4.5 on average. We also show that in some cases, the performance of the C4.5 induction algorithm significantly improved if features were discretized in advance; in our experiments, the performance never significantly degraded, an interesting phenomenon considering the fact the C4.5 is capable of loaclly discretizing features.

机译：许多监督式机器学习算法需要离散的特征空间。在本文中，我们回顾了有关连续特征离散化的先前工作，确定了方法的定义特征，并对几种方法进行了实证评估。我们将装箱（一种无监督的离散化方法）与基于熵和基于纯度的方法（有监督的算法）进行了比较。我们发现，使用基于熵的方法离散化特征时，朴素贝叶斯算法的性能得到了显着改善。实际上，在16个经过测试的数据集中，Naive-Bayes的离散版本平均比C4.5稍好。我们还表明，在某些情况下，如果预先离散化特征，则C4.5归纳算法的性能会显着提高。在我们的实验中，性能从未显着降低，考虑到C4.5能够局部离散特征这一事实，这是一个有趣的现象。

著录项

来源
《Machine learning(ML95) 》|1995年|p.194-202|共9页
会议地点 Tahoe City CA(US);Tahoe City CA(US)
作者
James Dougherty; Ron Kohavi; Mehran Sahami;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术 ;
关键词

相似文献

外文文献
中文文献
专利

1. Supervised, Unsupervised, and Semi-Supervised Feature Selection: A Review on Gene Selection [J] . Jun Chin Ang, Andri Mirzal, Habibollah Haron, IEEE/ACM transactions on computational biology and bioinformatics . 2016 ,第5期

机译：有监督，无监督和半监督特征选择：基因选择综述
2. Feature Selection for Genomic Signal Processing: Unsupervised, Supervised, and Self-Supervised Scenarios [J] . S. Y. Kung, Yuhui Luo, Man-Wai Mak Journal of Signal Processing Systems . 2010 ,第1期

机译：基因组信号处理的特征选择：无监督，有监督和自我监督的方案
3. Feature Selection for Genomic Signal Processing: Unsupervised, Supervised, and Self-Supervised Scenarios [J] . S. Y. Kung, Yuhui Luo, Man-Wai Mak Journal of signal processing systems for signal, image, and video technology . 2010 ,第1期

机译：基因组信号处理的特征选择：无监督，有监督和自我监督的方案
4. Supervised and Unsupervised Discretization of Continuous Features [C] . James Dougherty, Ron Kohavi, Mehran Sahami International conference on machine learning . 1995

机译：监督和无监督连续特征的离散化
5. Feature selection in supervised and unsupervised learning via evolutionary search. [D] . Kim, YongSeog. 2001

机译：通过进化搜索在有监督和无监督学习中进行特征选择。
6. Unsupervised Few-Shot Feature Learning via Self-Supervised Training [O] . Zilong Ji, Xiaolong Zou, Tiejun Huang, 2020

机译：通过自我监督的培训无需少量射击特征
7. Supervised and unsupervised discretization of continuous features [O] . James Dougherty, Ron Kohavi, Mehran Sahami 1995

机译：连续特征的监督和无监督离散化
8. Supervised and Unsupervised Discretization Methods for Evolutionary Algorithms [R] . Cantu-Paz, E. 2001

机译：进化算法的监督和无监督离散化方法

Supervised and Unsupervised Discretization of Continuous Features

摘要

著录项

相似文献

相关主题

期刊订阅