【24h】

Research of Text Categorization on WEKA

机译:Weka文本分类研究

获取原文

摘要

The choice of algorithm is a key text categorization problem. In order to evaluation synthetically, analyzed three popular text categorization algorithm that are naive Bayes (NB), decision tree(DT) and support vector machines(SVM). Carried on simulation experiment used the open source data mining tool of Weka. Experimental results show some significant conclusions: The performance of three classification methods are better, including Support vector machine classification of the best performance, highest precision and recall, naive Bayes second, the minimum Decision tree. Also found that classification performance associated not only the choice of the classification algorithm but also the differences between corpus categories.
机译:算法的选择是一个关键文本分类问题。 为了综合评估,分析了三种流行的文本分类算法,它是朴素贝叶斯(NB),决策树(DT)和支持向量机(SVM)。 进行仿真实验采用了Weka的开源数据采矿工具。 实验结果表明了一些重要结论:三种分类方法的性能更好,包括支持向量机器分类最佳性能,最高精度和召回,幼稚贝叶斯第二,最小决策树。 还发现,分类性能不仅关联分类算法的选择,还关联了语料库类别之间的差异。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号