首页> 外文会议>IEEE International Conference on Recent Advances and Innovations in Engineering >Modifying Naive Bayes classifier for multinomial text classification
【24h】

Modifying Naive Bayes classifier for multinomial text classification

机译:修改朴素贝叶斯分类器以进行多项式文本分类

获取原文

摘要

Data analysis of the textual data floating through social networking sites is being viewed as a promising source of knowledge people - the potential users and consumers of certain web application. The existing text mining techniques need to be correct and fast. Also, statistical classifiers like Naive Bayes are fast and easy to implement but do not perform well for imbalance text datasets or for datasets with highly correlated features. In this paper, a modified model for Naive Bayes classifier for multinomial text classification has been proposed by modifying the conventional bag of words model. The experimental results over benchmark datasets prove its superior performance than original Naive Bayes multinomial model. Feature selection and term weighting is combined with the proposed classifier for studying how well it can be implemented for various text mining applications.
机译:通过社交网站浮动的文本数据的数据分析被视为人们的有希望的知识来源-某些Web应用程序的潜在用户和消费者。现有的文本挖掘技术需要正确且快速。此外,像朴素贝叶斯这样的统计分类器快速,易于实现,但对于不平衡文本数据集或具有高度相关特征的数据集,效果不佳。通过修改传统的词袋模型,提出了一种朴素贝叶斯分类器用于文本分类的改进模型。在基准数据集上的实验结果证明了其性能优于原始的朴素贝叶斯多项式模型。特征选择和术语加权与提出的分类器相结合,用于研究如何针对各种文本挖掘应用程序很好地实现它。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号