首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >A Correlation-Based Feature Weighting Filter for Naive Bayes
【24h】

A Correlation-Based Feature Weighting Filter for Naive Bayes

机译:基于相关性的朴素贝叶斯特征加权滤波器

获取原文
获取原文并翻译 | 示例
           

摘要

Due to its simplicity, efficiency, and efficacy, naive Bayes (NB) has continued to be one of the top 10 algorithms in the data mining and machine learning community. Of numerous approaches to alleviating its conditional independence assumption, feature weighting has placed more emphasis on highly predictive features than those that are less predictive. In this paper, we argue that for NB highly predictive features should be highly correlated with the class (maximum mutual relevance), yet uncorrelated with other features (minimum mutual redundancy). Based on this premise, we propose a correlation-based feature weighting (CFW) filter for NB. In CFW, the weight for a feature is a sigmoid transformation of the difference between the feature-class correlation (mutual relevance) and the average feature-feature intercorrelation (average mutual redundancy). Experimental results show that NB with CFW significantly outperforms NB and all the other existing state-of-the-art feature weighting filters used to compare. Compared to feature weighting wrappers for improving NB, the main advantages of CFW are its low computational complexity (no search involved) and the fact that it maintains the simplicity of the final model. Besides, we apply CFW to text classification and have achieved remarkable improvements.
机译:由于其简单性,效率和功效,朴素贝叶斯(NB)一直是数据挖掘和机器学习社区中排名前10位的算法之一。在减轻其条件独立性假设的众多方法中,特征权重比那些预测性较差的特征更加重视高度预测性的特征。在本文中,我们认为对于NB,高预测性特征应与类别高度相关(最大互相关),而与其他特征不相关(最小互冗余)。在此前提下,我们为NB提出了一种基于相关性的特征加权(CFW)滤波器。在CFW中,特征的权重是特征类相关性(相互相关性)与平均特征-特征互相关性(平均相互冗余度)之差的S型变换。实验结果表明,具有CFW的NB明显优于NB和用于比较的所有其他现有的最新功能加权滤波器。与用于改进NB的特征加权包装器相比,CFW的主要优点是其计算复杂度低(不涉及搜索),并且它保持最终模型的简单性。此外,我们将CFW应用于文本分类并取得了显着的进步。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号