首页> 外文期刊>International journal of information technology and web engineering >An Experimental Study for the Effect of Stop Words Elimination for Arabic Text Classification Algorithms
【24h】

An Experimental Study for the Effect of Stop Words Elimination for Arabic Text Classification Algorithms

机译:消除废词对阿拉伯文本分类算法影响的实验研究

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, an experimental study was conducted on three techniques for Arabic text classification. These techniques are Support Vector Machine (SVM) with Sequential Minimal Optimization (SMO), Naive Bayesian (NB), and J48. The paper assesses the accuracy for each classifier and determines which classifier is more accurate for Arabic text classification based on stop words elimination. The accuracy for each classifier is measured by Percentage split method (holdout), and K-fold cross validation methods, along with the time needed to classify Arabic text. The results show that the SMO classifier achieves the highest accuracy and the lowest error rate, and shows that the time needed to build the SMO model is much lower compared to other classification techniques.
机译:在本文中,对三种用于阿拉伯文本分类的技术进行了实验研究。这些技术是具有顺序最小优化(SMO)的支持向量机(SVM),朴素贝叶斯(NB)和J48。本文评估了每个分类器的准确性,并基于消除停用词来确定哪个分类器更适合阿拉伯文本分类。每个分类器的准确性通过百分比拆分方法(保持)和K倍交叉验证方法以及对阿拉伯文本进行分类所需的时间来衡量。结果表明,SMO分类器实现了最高的准确性和最低的错误率,并且表明与其他分类技术相比,构建SMO模型所需的时间要短得多。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号