Robust Arabic Text Categorization by Combining Convolutional and Recurrent Neural Networks

Ameur Mohamed Seghir Hadj; Belkebir Riadh; Guessoum Ahmed

首页> 外文期刊>ACM transactions on Asian language information processing >Robust Arabic Text Categorization by Combining Convolutional and Recurrent Neural Networks

【24h】

Robust Arabic Text Categorization by Combining Convolutional and Recurrent Neural Networks

机译：通过组合卷积和经常性神经网络来强大的阿拉伯语文本分类

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Text Categorization is an important task in the area of Natural Language Processing (NLP). Its goal is to learn a model that can accurately classify any textual document for a given language into one of a set of predefined categories. In the context of the Arabic language, several approaches have been proposed to tackle this problem, many of which are based on the bag-of-words assumption. Even though these methods usually produce good results for the classification task, they often fail to capture contextual dependencies from textual data. On the other hand, deep learning architectures that are usually based on Recurrent Neural Networks (RNNs) or Convolutional Neural Networks (CNNs) do not suffer from such a limitation and have recently shown very promising results in various NLP applications. In this work, we use deep learning models that combine RNN and CNN for the task of Arabic text categorization using static, dynamic, and fine-tuned word embeddings. The experimental results reported on the Open Source Arabic Corpora (OSAC) dataset have shown the effectiveness and high performance of our proposed models.

机译：文本分类是自然语言处理区域（NLP）中的一个重要任务。其目标是学习一个模型，可以将给定语言的任何文本文档分类为一组预定义类别。在阿拉伯语语言的背景下，已经提出了几种方法来解决这个问题，其中许多是基于单词的假设。尽管这些方法通常对分类任务产生良好的结果，但它们通常无法从文本数据中捕获上下文依赖关系。另一方面，通常基于经常性神经网络（RNN）或卷积神经网络（CNNS）的深度学习架构不会遭受这种限制，并且最近在各种NLP应用中显示了非常有前途的结果。在这项工作中，我们使用使用静态，动态和微调单词嵌入的阿拉伯文分类的任务组合RNN和CNN的深度学习模型。在开源阿拉伯语学数（OSAC）数据集上报告的实验结果表明了我们所提出的模型的有效性和高性能。

著录项

来源
《ACM transactions on Asian language information processing》 |2020年第5期|66.1-66.16|共16页
作者
Ameur Mohamed Seghir Hadj; Belkebir Riadh; Guessoum Ahmed;
展开▼
作者单位

Univ Sci & Technol Houari Boumediene USTHB TALAA NLP ML & Applicat Res Grp Lab Res AI LRIA Algiers 16111 Algeria;

Univ Sci & Technol Houari Boumediene USTHB TALAA NLP ML & Applicat Res Grp Lab Res AI LRIA Algiers 16111 Algeria;

Univ Sci & Technol Houari Boumediene USTHB TALAA NLP ML & Applicat Res Grp Lab Res AI LRIA Algiers 16111 Algeria;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Natural language processing; Arabic language; Arabic text categorization; Arabic text classification; deep learning; convolutional neural networks; recurrent neural networks; pretrained word embeddings;

机译：自然语言处理;阿拉伯语;阿拉伯语文本分类;阿拉伯文文本分类;深入学习;卷积神经网络;经常性神经网络;普里雷染文字嵌入;

相似文献

外文文献
中文文献
专利

1. Robust Recognition of Chinese Text from Cellphone-acquired Low-quality Identity Card Images Using Convolutional Recurrent Neural Network [J] . Jianmei Wang, Ruize Wu, Shaoming Zhang Sensors and materials . 2021,第4期

机译：使用卷积经常性神经网络从手机获取的低质量识别卡片图像中恢复中文文本的鲁棒识别
2. A Novel Text Representation Model to Categorize Text Documents using Convolution Neural Network [J] . M. B. Revanasiddappa, B. S. Harish International Journal of Intelligent Systems and Applications . 2019,第5期

机译：利用卷积神经网络对文本文档进行分类的新型文本表示模型
3. Real-time Arabic scene text detection using fully convolutional neural networks [J] . Rajae Moumen, Raddouane Chiheb, Rdouan Faizi International Journal of Electrical and Computer Engineering . 2021,第2期

机译：使用完全卷积神经网络的实时阿拉伯语场景文本检测
4. Semantic Meaning Based Bengali Web Text Categorization Using Deep Convolutional and Recurrent Neural Networks (DCRNNs) [C] . Md. Rajib Hossain, Mohammed Moshiul Hoque International Conference on Internet of Things and Connected Technologies . 2021

机译：语义含义基于孟加拉网络文本分类使用深卷积和经常性神经网络（DCRNNS）
5. Deep Neural Language Model for Text Classification Based on Convolutional and Recurrent Neural Networks [D] . Hassan, Abdalraouf. 2018

机译：基于卷积神经网络和递归神经网络的深度神经语言文本分类模型
6. Semi-supervised Convolutional Neural Networks for Text Categorization via Region Embedding [O] . Rie Johnson, Tong Zhang -1

机译：基于区域嵌入的半监督卷积神经网络文本分类
7. Comparative effectiveness of convolutional neural network (CNN) and recurrent neural network (RNN) architectures for radiology text report classification [O] . Imon Banerjee, Yuan Ling, Matthew C. Chen, 2019

机译：卷积神经网络（CNN）和反复性神经网络（RNN）架构对放射学文本报告分类的比较有效性

Robust Arabic Text Categorization by Combining Convolutional and Recurrent Neural Networks

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅