首页> 外文会议>International Conference on Natural Language Processing >Comparative Study of Stemming Strategy for Hadith Text Classification
【24h】

Comparative Study of Stemming Strategy for Hadith Text Classification

机译:Hadith文本分类中词干提取策略的比较研究

获取原文

摘要

Arabic language is a language has a deep meaning. Text Classification has become trending in this era due to increasing data on the Internet. However, there is only few researchers only work on Arabic language due to the difficulties of Arabic language. In this study, the researcher using Hadith Arabic Text Dataset with different Stemming strategy method. The researcher managed to collect 1253 documents of Hadith sahih from different kitab which are Bukhary, Muslim and Tirmizi. Convolutional Neural Neural with Linear Support Vector Machine were used as classifier in order to investigate the performance of the model. The experiment were conducted with three different Arabic strategy method; light stemming, Khoja Stemming and no stemming method. The non-stemming strategy managed to get higher accuracy compared to other strategy method which is 89.05% of accuracy.
机译:阿拉伯语是一种有着深刻含义的语言。由于互联网上数据的增加,文本分类已成为这个时代的趋势。然而,由于阿拉伯语的困难,只有很少的研究人员专门研究阿拉伯语。在本研究中,研究者使用了不同词干策略的Hadith阿拉伯语文本数据集。研究人员从布哈里、穆斯林和提尔米兹等不同的基塔布收集了1253份圣训文献。采用卷积神经网络和线性支持向量机作为分类器,研究了模型的性能。实验采用三种不同的阿拉伯语策略方法进行;轻填塞、Khoja填塞和无填塞方法。与其他策略方法相比,非阻塞策略获得了更高的准确率,为89.05%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号