首页> 外文期刊>ACM transactions on Asian language information processing >Robust Arabic Text Categorization by Combining Convolutional and Recurrent Neural Networks
【24h】

Robust Arabic Text Categorization by Combining Convolutional and Recurrent Neural Networks

机译:通过组合卷积和经常性神经网络来强大的阿拉伯语文本分类

获取原文
获取原文并翻译 | 示例
       

摘要

Text Categorization is an important task in the area of Natural Language Processing (NLP). Its goal is to learn a model that can accurately classify any textual document for a given language into one of a set of predefined categories. In the context of the Arabic language, several approaches have been proposed to tackle this problem, many of which are based on the bag-of-words assumption. Even though these methods usually produce good results for the classification task, they often fail to capture contextual dependencies from textual data. On the other hand, deep learning architectures that are usually based on Recurrent Neural Networks (RNNs) or Convolutional Neural Networks (CNNs) do not suffer from such a limitation and have recently shown very promising results in various NLP applications. In this work, we use deep learning models that combine RNN and CNN for the task of Arabic text categorization using static, dynamic, and fine-tuned word embeddings. The experimental results reported on the Open Source Arabic Corpora (OSAC) dataset have shown the effectiveness and high performance of our proposed models.
机译:文本分类是自然语言处理区域(NLP)中的一个重要任务。其目标是学习一个模型,可以将给定语言的任何文本文档分类为一组预定义类别。在阿拉伯语语言的背景下,已经提出了几种方法来解决这个问题,其中许多是基于单词的假设。尽管这些方法通常对分类任务产生良好的结果,但它们通常无法从文本数据中捕获上下文依赖关系。另一方面,通常基于经常性神经网络(RNN)或卷积神经网络(CNNS)的深度学习架构不会遭受这种限制,并且最近在各种NLP应用中显示了非常有前途的结果。在这项工作中,我们使用使用静态,动态和微调单词嵌入的阿拉伯文分类的任务组合RNN和CNN的深度学习模型。在开源阿拉伯语学数(OSAC)数据集上报告的实验结果表明了我们所提出的模型的有效性和高性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号