A New Instance-weighting Naive Bayes Text Classifiers

机译：一个新的实例加权天真贝叶斯文本分类器

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

It is shown in recent research that naive Bayes text classifiers have achieved noticeable classification performance despite its strong assumption of conditional independence among features. In order to weaken this unrealistic assumption and improve the classification accuracy, there are generally three methods: structures manipulating, features manipulating, and instances manipulating. Instances manipulating can be further divided into instance-weighting and instance-selecting. In this paper, we propose a new instance-weighting approach to naive Bayes text classifier. In this new approach, the training dataset is firstly divided into several subsets according to their class value. Then every training instance in a subset is weighted according to the distance between it and the mean of the training subset. The experimental results on 15 text document datasets show that in terms of the accuracy of classification, our method performs better than three existing naive Bayes text classifiers.

机译：在最近的研究中显示，尽管有强烈的特征独立假设，但Naive Bayes文本分类器已经实现了明显的分类表现。为了削弱这种不现实的假设并提高分类准确性，通常存在三种方法：操纵结构，操纵和操作的情况。操作可以进一步分为执行实例加权和实例选择。在本文中，我们向Naive Bayes文本分类器提出了一种新的实例加权方法。在这种新方法中，训练数据集首先根据其类值分为多个子集。然后根据其之间的距离和训练子集的距离来加权子集中的每个训练实例。在15个文本文档数据集上的实验结果表明，就分类的准确性而言，我们的方法比三个现有的天真贝叶斯文本分类器更好。

著录项

来源
《IEEE International Conference of Intelligent Robotic and Control Engineering》|2018年|279p|共5页
会议地点
作者
Yongcheng Wu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP2-53;
关键词
Training; Text categorization; Mathematical model; Classification algorithms; Prediction algorithms; Standards; Probability;

机译：培训;文本分类;数学模型;分类算法;预测算法;标准;概率;

相似文献

外文文献
中文文献
专利

1. Text Classification for Student Data Set using Naive Bayes Classifier and KNN Classifier [J] . Rajeswari R.P, Kavitha Juliet, Dr.Aradhana International Journal of Computer Trends and Technology . 2017,第1期

机译：使用朴素贝叶斯分类器和KNN分类器对学生数据集进行文本分类
2. Chinese text classification by the Naive Bayes Classifier and the associative classifier with multiple confidence threshold values [J] . Shing-Hwa Lu, Ding-An Chiang, Huan-Chao Keh, Knowledge-Based Systems . 2010,第6期

机译：通过朴素贝叶斯分类器和具有多个置信度阈值的关联分类器对中文文本进行分类
3. Estimating a one-class naive Bayes text classifier [J] . Zhang Yihong, Jatowt Adam Intelligent data analysis . 2020,第3期

机译：估计一流的天真贝叶斯文本分类器
4. A New Instance-weighting Naive Bayes Text Classifiers [C] . Yongcheng Wu IEEE International Conference of Intelligent Robotic and Control Engineering . 2018

机译：新的实例加权朴素贝叶斯文本分类器
5. Application of a Hidden Bayes Naive Multiclass Classifier in Network Intrusion Detection [D] . Koc, Levent. 2013

机译：隐藏式贝叶斯朴素多类分类器在网络入侵检测中的应用
6. Prediction of Protein Acetylation Sites using Kernel Naive Bayes Classifier Based on Protein Sequences Profiling [O] . Md. Shakil Ahmed, Md. Shahjaman, Enamul Kabir, 2018

机译：基于蛋白质序列分析的朴素贝叶斯分类器预测蛋白质乙酰化位点
7. Komparasi Akurasi Metode Correlated Naive Bayes Classifier dan Naive Bayes Classifier untuk Diagnosis Penyakit Diabetes [O] . Hairani Hairani, Gibran Satya Nugraha, Mokhammad Nurkholis Abdillah, 2018

机译：与幼稚贝叶斯分类器和幼稚贝叶斯分类器方法的准确性的比较诊断糖尿病患者

A New Instance-weighting Naive Bayes Text Classifiers

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅