首页> 外文会议>ACS International Conference on Computer Systems and Applications >A hybrid BSO-Chi2-SVM approach to Arabic text categorization
【24h】

A hybrid BSO-Chi2-SVM approach to Arabic text categorization

机译:阿拉伯文文本分类的混合BSO-CHI2-SVM方法

获取原文

摘要

Automatic categorization of documents has become an important task, especially with the rapid growth of the number of documents available online. Automatic categorization of documents consists in assigning a category to a text based on the information it contains. It aims to automate the association of a document with a category. Automatic categorization can allow solving several problems such as identifying the language of a document, the filtering and detection of spam (junk mail), the routing and forwarding of emails to their recipients, etc. In this paper, we present the results of Arabic text categorization based on three different approaches: artificial neural networks, support vector machines (SVMs) and a hybrid approach BSO-CHI-SVM. We explain the approach and present the results of the implementation and evaluation using two types of representations: root-based stemming and light stemming. The evaluation in each case was done on the Open Source Arabic Corpora (OSAC) using different performance measures.
机译:自动分类文件已成为一项重要任务,特别是随着在线提供的文件数量的快速增长。自动分类文档在于根据其包含的信息分配给文本的类别。它旨在使文档与类别的关联自动化。自动分类可以解决若干问题,例如识别文档的语言,垃圾邮件(垃圾邮件)的过滤和检测,对其收件人的电子邮件的路由和转发。在本文中,我们介绍了阿拉伯文的结果基于三种不同方法的分类:人工神经网络,支持向量机(SVM)和混合方法BSO-CHI-SVM。我们解释了这种方法,并使用两种类型的陈述展示了实施和评估结果:基于根本的终止和光线。使用不同的性能措施,每种情况的评估是在开源阿拉伯语料库(OSAC)上完成的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号