首页> 外文会议>IEEE International Symposium on IT in Medicine Education >Building naive bayes document classifier using word clusters based on bootstrap averaging

【24h】

Building naive bayes document classifier using word clusters based on bootstrap averaging

机译：使用基于Bootstrap平均的Word Clusters构建Naive Bayes文档分类器

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Aimed to solve the problem of low classification accuracy caused by poor distribution estimation by training naive Bayes document classifier on word clusters, we build a sequential word list based on mutual information between words and their semantic cluster labels, then construct a sample set of the same size with the word list through bootstrap sampling and use the average of the corresponding parameters estimated from the sample set as the last parameter to classify unknown documents. Experiment results on benchmark document data sets show that the proposed strategy gains higher classification accuracy comparing to naive Bayes documents classifier on word clusters or on words.

机译：旨在解决通过培训Naive Bayes文档分类器在Word Clusters上造成的低分类准确性的问题，我们基于单词和他们的语义群集标签之间的相互信息构建一个顺序单词列表，然后构造一个相同的样本集尺寸与单词列表通过自举采样，并使用从样本设置为最后一个参数的相应参数的平均值来对未知文档进行分类。基准文档数据集的实验结果表明，与Word集群或单词上的天真贝叶斯文档分类器相比，拟议的策略提高了更高的分类准确性。

著录项

来源
《IEEE International Symposium on IT in Medicine Education 》|2009年||共6页
会议地点
作者

展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 R-53;
关键词
Bayes methods; document handling; bootstrap averaging; bootstrap sampling; distribution estimation; naive Bayes document classifier; semantic cluster labels; word clusters;

机译：贝叶斯方法;文件处理;引导平均;自举采样;分发估计;天真贝叶斯文档分类器;语义群集标签;字集群;

相似文献

外文文献
中文文献
专利

1. A NAIVE BAYES CLASSIFIER FOR WEB DOCUMENT SUMMARIES CREATED BY USING WORD SIMILARITY AND SIGNIFICANT FACTORS [J] . MARIA SOLEDAD PERA, YIU-KAI NG International Journal of Artificial Intelligence Tools: Architectures, Languages, Algorithms . 2010 ,第4期

机译：使用单词相似度和重要因素创建的Web文档摘要的朴素贝叶斯分类器
2. Applying Naive Bayes Classifier to Document Clustering [J] . Jie Ji, Qiangfu Zhao Journal of Advanced Computatioanl Intelligence and Intelligent Informatics . 2010 ,第6a78期

机译：将朴素贝叶斯分类器应用于文档聚类
3. Compression-Based Averaging of Selective Naive Bayes Classifiers [J] . Boull?? Marc Journal of machine learning research . 2007 ,第Jul期

机译：选择性朴素贝叶斯分类器的基于压缩的平均
4. Building Naive Bayes Document Classifier Using Word Clusters Based on Bootstrap Averaging [C] . WANG Yuanzhe, ZHANG Qiang, BAI Liyuan 2009 IEEE International Symposium on IT in Medicine Education( IEEE 教育与医药信息化国际会议）论文集 . 2009

机译：使用基于Bootstrap平均的Word集群构建Naive Bayes文档分类器
5. Application of a Hidden Bayes Naive Multiclass Classifier in Network Intrusion Detection [D] . Koc, Levent. 2013

机译：隐藏式贝叶斯朴素多类分类器在网络入侵检测中的应用
6. Prediction of Protein Acetylation Sites using Kernel Naive Bayes Classifier Based on Protein Sequences Profiling [O] . Md. Shakil Ahmed, Md. Shahjaman, Enamul Kabir, 2018

机译：基于蛋白质序列分析的朴素贝叶斯分类器预测蛋白质乙酰化位点
7. Komparasi Akurasi Metode Correlated Naive Bayes Classifier dan Naive Bayes Classifier untuk Diagnosis Penyakit Diabetes [O] . Hairani Hairani, Gibran Satya Nugraha, Mokhammad Nurkholis Abdillah, 2018

机译：与幼稚贝叶斯分类器和幼稚贝叶斯分类器方法的准确性的比较诊断糖尿病患者

Building naive bayes document classifier using word clusters based on bootstrap averaging

摘要

著录项

相似文献

相关主题

期刊订阅