Research and Realization of Naive Bayes English Text Classification Method Based on Base Noun Phrase Identification

机译：基于基础名词短语识别的朴素贝叶斯英语文本分类方法的研究与实现

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

To more advance classification accuracy of English texts, Naïve Bayes method based on base noun phrase (BaseNP) identification is presented. The rising maximum entropy model is applied to the identification. Firstly, use training corpus and user-defined feature templates to generate candidate features. Secondly, the feature selection algorithm computing feature gains is applied to select features. Finally, at the parameter estimation stage, the improved iterative scaling (IIS) algorithm is adopted. The experimental results show that this technique achieved precision and recall rates of roughly 93% for BaseNP identification and the classification accuracy is remarkably improved on this basis. It indicates that shallow parsing of high accuracy is very helpful to text classification.

机译：为了提高英语文本的分类准确性，提出了一种基于基础名词短语（BaseNP）识别的朴素贝叶斯方法。上升的最大熵模型被应用于识别。首先，使用训练语料库和用户定义的特征模板来生成候选特征。其次，将计算特征增益的特征选择算法应用于特征选择。最后，在参数估计阶段，采用了改进的迭代缩放（IIS）算法。实验结果表明，该技术对BaseNP的识别精度和召回率约为93％，在此基础上，分类精度得到了显着提高。这表明高精度的浅层解析对文本分类非常有帮助。

著录项

来源
《Information and Communications Technology, 2005. Enabling Technologies for the New Knowledge Society: ITI 3rd International Conference on》|2005年|P.805-812|共8页
会议地点
作者
Lin Lv; Yu-Shu Liu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类工业技术;
关键词
Base Noun Phrase; Maximum Entropy Model; Naïve Bayes; Phrase Identification; Text Classification; Base Noun Phrase; Maximum Entropy Model; Naïve Bayes; Phrase Identification; Text Classification;

机译：基本名词短语;最大熵模型;朴素贝叶斯;短语识别;文本分类;基本名词短语;最大熵模型;朴素贝叶斯;短语识别;文本分类;

相似文献

外文文献
中文文献
专利

1. Integrating associative rule-based classification with Naive Bayes for text classification [J] . Hadi Wael, Al-Radaideh Qasem A., Alhawari Samer Applied Soft Computing . 2018,第期

机译：将基于关联规则的分类与Naive Bayes集成进行文本分类
2. Improved feature size customized fast correlation-based filter for Naive Bayes text classification [J] . Journal of intelligent & fuzzy systems: Applications in Engineering and Technology . 2020,第3期

机译：改进的特征尺寸自定义基于快速相关的基于快速相关的过滤器，用于Naive Bayes文本分类
3. Towards perfect text classification with Wikipedia-based semantic Naive Bayes learning [J] . Kim Han-joon, Kim Jiyun, Kim Jinseog, Neurocomputing . 2018,第NOVa13期

机译：通过基于维基百科的语义朴素贝叶斯学习实现完美的文本分类
4. Research and Realization of Naive Bayes English Text Classification Method Based on Base Noun Phrase Identification [C] . Lin Lv, Yu-Shu Liu International Conference on Information and Communications Technology . 2005

机译：基于基础名词短语识别的天真贝叶斯英语文本分类方法的研究与实现
5. Naive Bayes and similarity based methods for identifying computer users using keystroke patterns. [D] . Joshi, Shrijit S. 2009

机译：朴素贝叶斯和基于相似度的使用击键模式识别计算机用户的方法。
6. Naive Bayes classifiers for verbal autopsies: comparison to physician-based classification for 21000 child and adult deaths [O] . Pierre Miasnikof, Vasily Giannakeas, Mireille Gomes, 2015

机译：朴素贝叶斯言语尸检分类器：与基于医师的21000名儿童和成人死亡分类比较
7. A new feature selection score for multinomial naive bayes text classification based on kl-divergence [O] . Karl-michael Schneider 2004

机译：基于kl散度的多项式朴素贝叶斯文本分类的新特征选择得分

Research and Realization of Naive Bayes English Text Classification Method Based on Base Noun Phrase Identification

摘要

著录项

相似文献

相关主题

期刊订阅