首页> 外文期刊>Computational intelligence and neuroscience >Automated Amharic News Categorization Using Deep Learning Models
【24h】

Automated Amharic News Categorization Using Deep Learning Models

机译:使用深度学习模型自动进行阿姆哈拉语新闻分类

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

For decades, machine learning techniques have been used to process Amharic texts. The potential application of deep learning on Amharic document classification has not been exploited due to a lack of language resources. In this paper, we present a deep learning model for Amharic news document classification. The proposed model uses fastText to generate text vectors to represent semantic meaning of texts and solve the problem of traditional methods. The text vectors matrix is then fed into the embedding layer of a convolutional neural network (CNN), which automatically extracts features. We conduct experiments on a data set with six news categories, and our approach produced a classification accuracy of 93.79. We compared our method to well-known machine learning algorithms such as support vector machine (SVM), multilayer perceptron (MLP), decision tree (DT), XGBoost (XGB), and random forest (RF) and achieved good results.
机译:几十年来,机器学习技术一直被用于处理阿姆哈拉语文本。由于缺乏语言资源,深度学习在阿姆哈拉语文档分类中的潜在应用尚未得到开发。在本文中,我们提出了一种用于阿姆哈拉语新闻文档分类的深度学习模型。该模型使用fastText生成文本向量来表示文本的语义意义,解决了传统方法的问题。然后将文本向量矩阵输入卷积神经网络 (CNN) 的嵌入层,该网络会自动提取特征。我们对具有六个新闻类别的数据集进行了实验,我们的方法产生了 93.79% 的分类准确率。我们将我们的方法与支持向量机(SVM)、多层感知器(MLP)、决策树(DT)、XGBoost(XGB)和随机森林(RF)等知名机器学习算法进行了比较,并取得了良好的效果。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号