Sparse Weighted Naive Bayes Classifier for Efficient Classification of Categorical Data

机译：高效分类数据的稀疏加权朴素贝叶斯分类器

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Feature selection has become a key challenge in machine learning with the rapid growth of data size in real-world applications. However, existing feature selection methods mainly focus on numeric data, which will lead to quality loss when handling classification problems involving categorical variables. In this paper, we proposed an improvement of Bayesian Classifier with the sparse regression technology. To the best of our knowledge, this is the first attempt to extend sparse regression for directly process of categorical variables. We implemented the idea for the case of weighted naive Bayes classifier. The introduction of L1 regularized learning ensures the algorithm to retain only a minimal subset of variables for model building, while at the same time achieves a near-optimal decision hyper-plane, which leads to excellent performance in case of high dimensional or small sample size situations. We carried out benchmark test on five UCI benchmark categorical data sets, which proved that the proposed algorithm have competitive performances over the original weighted naive bayes classifier and several state-of-the-art feature selection methods including L1 logistic regression and SVM-RFE.

机译：随着现实应用程序中数据大小的快速增长，特征选择已成为机器学习中的关键挑战。然而，现有的特征选择方法主要集中在数值数据上，这将在处理涉及分类变量的分类问题时导致质量损失。在本文中，我们提出了一种基于稀疏回归技术的贝叶斯分类器的改进方法。据我们所知，这是首次将稀疏回归扩展到直接处理分类变量的尝试。对于加权朴素贝叶斯分类器，我们实现了这种想法。 L1正则化学习的引入确保了算法仅保留变量的最小子集用于模型构建，同时实现了接近最优的决策超平面，这在高维或小样本量的情况下具有出色的性能情况。我们对5个UCI基准分类数据集进行了基准测试，证明了该算法与原始加权朴素贝叶斯分类器相比，具有竞争优势，并具有包括L1 logistic回归和SVM-RFE在内的几种最新特征选择方法。

著录项

来源
《IEEE International Conference on Data Science in Cyberspace》|2018年|691-696|共6页
会议地点
作者
Zhuoyuan Zheng; Yunpeng Cai; Yujie Yang; Ye Li;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Handheld computers; Biomedical informatics; Conferences; Data science; Cyberspace; Software; Graphics;

机译：手持式计算机;生物医学信息学;会议;数据科学;网络空间;软件;图形;
入库时间 2022-08-26 13:50:04

相似文献

外文文献
中文文献
专利

1. Text Classification for Student Data Set using Naive Bayes Classifier and KNN Classifier [J] . Rajeswari R.P, Kavitha Juliet, Dr.Aradhana International Journal of Computer Trends and Technology . 2017,第1期

机译：使用朴素贝叶斯分类器和KNN分类器对学生数据集进行文本分类
2. Multi-Feature Analysis for Automated Brain Stroke Classification Using Weighted Gaussian Naive Bayes Classifier [J] . Jayachitra S., Prasanth A. Journal of circuits, systems and computers . 2021,第10期

机译：自动脑卒中分类多种特征分析使用加权高斯天真贝叶斯分类器
3. ANALYSIS OF RELATIONSHIP BETWEEN RENYI ENTROPY AND MARGINAL BAYES ERROR AND ITS APPLICATION TO WEIGHTED NAIVE BAYES CLASSIFIERS [J] . TOMOMI ENDO, KAZUHIRO OMURA, MINEICHI KUDO International Journal of Pattern Recognition and Artificial Intelligence . 2014,第7期

机译：仁义熵与边际贝氏误差的关系分析及其在加权朴素贝叶斯分类器中的应用
4. Sparse Weighted Naive Bayes Classifier for Efficient Classification of Categorical Data [C] . Zhuoyuan Zheng, Yunpeng Cai, Yujie Yang, IEEE International Conference on Data Science in Cyberspace . 2018

机译：稀疏加权朴素贝叶斯分类器，用于有效分类的分类数据
5. Modern Considerations for the Use of Naive Bayes in the Supervised Classification of Genetic Sequence Data [D] . Lakin, Steven M. 2021

机译：在遗传序列数据监督分类中使用Naive Bayes的现代考虑因素
6. Naive Bayes classifiers for verbal autopsies: comparison to physician-based classification for 21000 child and adult deaths [O] . Pierre Miasnikof, Vasily Giannakeas, Mireille Gomes, 2015

机译：朴素贝叶斯言语尸检分类器：与基于医师的21000名儿童和成人死亡分类比较
7. Efficient learning of naive Bayes classifiers under class-conditional classification noise [O] . François Denis, Christophe Nicolas Magnan, Liva Ralaivola 2006

机译：类条件分类噪声下朴素贝叶斯分类器的高效学习

Sparse Weighted Naive Bayes Classifier for Efficient Classification of Categorical Data

摘要

著录项

相似文献

相关主题

期刊订阅