THE IMPROVEMENT OF NAIVE BAYESIAN CLASSIFIER BASED ON THE STRATEGY OF FEATURE SELECTION AND SAMPLE CLEANING

机译：基于特征选择和样品清洁策略的幼稚贝叶斯分类器的改进

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Naive Bayesian Classifier (NBC) is a simple and effective classification model. Though it shows a lot of edges over many other Classifiers, it does not always yield satisfactory result. In this paper, we give a summary of the previous improvement methods for the NBC model. Then three improvement strategies are proposed: the feature selection strategy, the sample cleaning strategy and the mixed strategy. By choosing the optimized feature subset according to the feature important factor (FIF) of every feature, the first method simplifies the dimensionality of dataset; the second method deletes noisy samples within the training dataset according to the sample polluting factor; the third method integrates the two methods: first the feature selection, and then the sample cleaning. Through the experimental comparison and analysis on the UCI repository, these strategies are proved effective. Averagely speaking, with 36.76% of the features in the original feature set, we can raise the prediction accuracy by 2.30% using the first method. While with 92.57% samples in the training dataset, we can raise the prediction accuracy by 1.59% using the second method. As to the third method, the prediction accuracy can be increased 2.55%. Among these strategies, the mixed one shows the advantages over the other two, which reduces the complexity of the model while increasing the prediction accuracy of the NBC model.

机译：天真贝叶斯分类器（NBC）是一个简单有效的分类模型。虽然它在许多其他分类器上显示了很多边缘，但它并不总是产生令人满意的结果。在本文中，我们概述了NBC模型的先前改进方法。然后提出了三种改进策略：特征选择策略，样品清洁策略和混合策略。通过根据每个特征的特征重要因素（FIF）选择优化的特征子集，第一种方法简化了数据集的维度;第二种方法根据样品污染因子删除训练数据集内的噪声样本;第三种方法集成了这两种方法：首先是特征选择，然后进行样品清洁。通过对UCI存储库的实验比较和分析，证明了这些策略有效。平均说话，有36.76％的原始功能集中的功能，我们可以使用第一种方法将预测精度提高2.30％。虽然在训练数据集中有92.57％的样本，但我们可以使用第二种方法将预测精度提高1.59％。关于第三种方法，预测精度可以增加2.55％。在这些策略中，混合的策略显示了其他两个的优点，这减少了模型的复杂性，同时增加了NBC模型的预测精度。

著录项

来源
《International Symposium on Knowledge and Systems Sciences》|2006年||共8页
会议地点
作者
Xuefeng Zhang; Peng Liu; Wei Zhang; Jinjin Fan; Hujun Zhu; Jie Yang; Jun Yang; International Society for Knowledge and Systems Sciences; Chinese Academy of Sciences; Chinese Academy of Sciences;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类知识学;
关键词
naive bayesian classifier; feature selection; sample cleaning; feature important factor; sample polluting factor;

机译：天真贝叶斯分类器;特征选择;样品清洁;特征重要因素;样品污染因子;

相似文献

外文文献
中文文献
专利

1. Strategy for the selection of input ground motion for inelastic structural response analysis based on naive Bayesian classifier [J] . Lancieri M., Renault M., Berge-Thierry C., Bulletin of earthquake engineering . 2015,第9期

机译：基于朴素贝叶斯分类器的非弹性结构响应分析输入地震动选择策略
2. FEATURE SELECTION FOR THE NAIVE BAYESIAN CLASSIFIER USING DECISION TREES [J] . CHOTIRAT ANN RATANAMAHATANA, DIMITRIOS GUNOPULOS Applied Artificial Intelligence . 2003,第5a6期

机译：基于决策树的朴素贝叶斯分类器特征选择
3. Entropy Based Feature Selection For Multi-Relational Na?ve Bayesian Classifier [J] . Vaghela Vimalkumar B, Vandra Kalpesh H, Modi Nilesh K Journal of International Technology and Information Management . 2014,第1期

机译：多元朴素贝叶斯分类器的基于熵的特征选择
4. THE IMPROVEMENT OF NAIVE BAYESIAN CLASSIFIER BASED ON THE STRATEGY OF FEATURE SELECTION AND SAMPLE CLEANING [C] . Xuefeng Zhang, Peng Liu, Wei Zhang, International Symposium on Knowledge and Systems Sciences(KSS'2006); 20060922-25; Beijing(CN) . 2006

机译：基于特征选择和样本清洁策略的朴素贝叶斯分类器的改进
5. Implementation of categorization for faceted search using Naive Bayesian classifier. [D] . Mulpuri, Ravi Krishna. 2006

机译：使用朴素贝叶斯分类器实现多面搜索的分类。
6. A network clustering based feature selection strategy for classifying autism spectrum disorder [O] . Lingkai Tang, Sakib Mostafa, Bo Liao, 2019

机译：基于网络聚类的自闭症谱系分类特征选择策略
7. Search Strategies for Binary Feature Selection for a Naive Bayes Classifier [O] . Rabenoro Tsirizo, Lacaille Jérôme, Cottrell Marie, 2015

机译：朴素贝叶斯分类器二元特征选择的搜索策略

THE IMPROVEMENT OF NAIVE BAYESIAN CLASSIFIER BASED ON THE STRATEGY OF FEATURE SELECTION AND SAMPLE CLEANING

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅