Automated Amharic News Categorization Using Deep Learning Models

Demeke Endalie; Getamesay Haile

首页> 外文期刊>Computational intelligence and neuroscience >Automated Amharic News Categorization Using Deep Learning Models

【24h】

Automated Amharic News Categorization Using Deep Learning Models

机译：使用深度学习模型自动进行阿姆哈拉语新闻分类

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相关主题

摘要

For decades, machine learning techniques have been used to process Amharic texts. The potential application of deep learning on Amharic document classification has not been exploited due to a lack of language resources. In this paper, we present a deep learning model for Amharic news document classification. The proposed model uses fastText to generate text vectors to represent semantic meaning of texts and solve the problem of traditional methods. The text vectors matrix is then fed into the embedding layer of a convolutional neural network (CNN), which automatically extracts features. We conduct experiments on a data set with six news categories, and our approach produced a classification accuracy of 93.79. We compared our method to well-known machine learning algorithms such as support vector machine (SVM), multilayer perceptron (MLP), decision tree (DT), XGBoost (XGB), and random forest (RF) and achieved good results.

机译：几十年来，机器学习技术一直被用于处理阿姆哈拉语文本。由于缺乏语言资源，深度学习在阿姆哈拉语文档分类中的潜在应用尚未得到开发。在本文中，我们提出了一种用于阿姆哈拉语新闻文档分类的深度学习模型。该模型使用fastText生成文本向量来表示文本的语义意义，解决了传统方法的问题。然后将文本向量矩阵输入卷积神经网络（CNN）的嵌入层，该网络会自动提取特征。我们对具有六个新闻类别的数据集进行了实验，我们的方法产生了 93.79% 的分类准确率。我们将我们的方法与支持向量机（SVM）、多层感知器（MLP）、决策树（DT）、XGBoost（XGB）和随机森林（RF）等知名机器学习算法进行了比较，并取得了良好的效果。

著录项

来源
《Computational intelligence and neuroscience》 |2021年第4期|ArticleID3774607-ArticleID3774607|共1页
作者
Demeke Endalie; Getamesay Haile;
展开▼
作者单位

Faculty of Computing and Informatics, Jimma Institute of Technology, Jimma, Ethiopia;

展开▼
收录信息
原文格式 PDF
正文语种英语
中图分类寄生生物学;
关键词

Automated Amharic News Categorization Using Deep Learning Models

摘要

著录项

引文网络

相关主题

期刊订阅