首页> 中文期刊> 《计算机工程与设计》 >基于改进权重贝叶斯的维文文本分类模型

基于改进权重贝叶斯的维文文本分类模型

         

摘要

为提高朴素贝叶斯分类器的分类性能,考虑决策分类过程中条件属性的不同重要程度,提出了一种基于特征选择权重的贝叶斯分类算法.采用卡方值和文档频数相结合的数值来表示特征词的重要程度,对该值进行处理获得每个特征词权重,建立加权贝叶斯分类器.在研究维文特点的基础上,利用该算法构建了一个维文文本分类模型.在搜集到的维文语料库上进行的实验结果表明,该算法比朴素贝叶斯拥有更好的分类性能.%To improve the performance of the Naive Bayes classifier, a weighted Bayes method is proposed based on feature selection weight for taking into account different conditions have different effects to the decision conditions. Firstly, the effect value of every feature is computed by the combination of the Chi Square value and document frequency, DF. Then, the weight of every feature is computed by the effect value and weighted Bayes Classifier is built. A Uyghur text classification model is built based on the properties of Uyghur Language and this method. Results of the experiment based on the Uyghur corpus which collected from the internet indicate the metric had better classification performance than Naive Bayesian Classifier.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号