首页> 外文期刊>Journal of Applied Computer Science & Mathematics >Arabic Text Categorization Using Improved k-Nearest neighbour Algorithm
【24h】

Arabic Text Categorization Using Improved k-Nearest neighbour Algorithm

机译:使用改进的k最近邻算法的阿拉伯文本分类

获取原文
           

摘要

Abstract–The quantity of text information published inArabic language on the net requires the implementation ofeffective techniques for the extraction and classifying of relevantinformation contained in large corpus of texts. In this paper wepresented an implementation of an enhanced k-NN Arabic textclassifier. We apply the traditional k-NN and Naive Bayes fromWeka Toolkit for comparison purpose. Our proposed modifiedk-NN algorithm features an improved decision rule to skip theclasses that are less similar and identify the right class from knearest neighbours which increases the accuracy. The studyevaluates the improved decision rule technique using thestandard of recall, precision and f-measure as the basis ofcomparison. We concluded that the effectiveness of the proposedclassifier is promising and outperforms the classical k-NNclassifier.
机译:摘要–阿拉伯语在网络上发布的文本信息数量众多,需要实施有效的技术来提取和分类包含在大型文本语料库中的相关信息。在本文中,我们介绍了增强的k-NN阿拉伯文本分类器的实现。我们将传统的k-NN和来自Weka Toolkit的朴素贝叶斯应用于比较。我们提出的改进的k-NN算法具有改进的决策规则,可以跳过不太相似的类并从近邻邻居中识别正确的类,从而提高了准确性。本研究以召回率,精确度和f度量为标准,对改进的决策规则技术进行了评估。我们得出的结论是,提出的分类器的有效性是有希望的,并且优于经典的k-NN分类器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号