首页> 外文会议>2013 ACS International Conference on Computer Systems and Applications >A hybrid BSO-Chi2-SVM approach to Arabic text categorization
【24h】

A hybrid BSO-Chi2-SVM approach to Arabic text categorization

机译:混合BSO-Chi2-SVM方法进行阿拉伯文本分类

获取原文
获取原文并翻译 | 示例

摘要

Automatic categorization of documents has become an important task, especially with the rapid growth of the number of documents available online. Automatic categorization of documents consists in assigning a category to a text based on the information it contains. It aims to automate the association of a document with a category. Automatic categorization can allow solving several problems such as identifying the language of a document, the filtering and detection of spam (junk mail), the routing and forwarding of emails to their recipients, etc. In this paper, we present the results of Arabic text categorization based on three different approaches: artificial neural networks, support vector machines (SVMs) and a hybrid approach BSO-CHI-SVM. We explain the approach and present the results of the implementation and evaluation using two types of representations: root-based stemming and light stemming. The evaluation in each case was done on the Open Source Arabic Corpora (OSAC) using different performance measures.
机译:文档的自动分类已成为一项重要任务,尤其是随着在线可用文档数量的快速增长。文档的自动分类包括根据文本包含的信息为文本分配一个类别。它旨在使文档与类别的关联自动化。自动分类可以解决一些问题,例如识别文档的语言,过滤和检测垃圾邮件(垃圾邮件),将电子邮件路由和转发给收件人等。在本文中,我们介绍了阿拉伯文本的结果基于三种不同方法的分类:人工神经网络,支持向量机(SVM)和混合方法BSO-CHI-SVM。我们解释了该方法,并使用两种表示形式介绍了实现和评估的结果:基于根的词根和轻度词根。每种情况下的评估都是使用不同的绩效指标在开源阿拉伯语料库(OSAC)上进行的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号