首页> 外文会议>International Conference on Information and Communication Systems >Team Wa’ed Al-Shrida at the Mowjaz Multi-Topic Labelling Task
【24h】

Team Wa’ed Al-Shrida at the Mowjaz Multi-Topic Labelling Task

机译:Wa'ed Al-Shrida在Mowjaz多主题标签任务

获取原文
获取外文期刊封面目录资料

摘要

This paper describes an attempt in the "Mowjaz Multi-topic Labelling Task", the competition is about classifying the Arabic articles to its topic by using artificial intelligence and neural networks, the programming language that was used to classify the datasets is "Python". The attempt was started by uploading the datasets from the "Github website", the datasets that were used in the system include three groups, train, validation, and test datasets. The "Pyarabic" and simple-transformers" libraries were used to allow the system to manipulate Arabic letters and simplify the usage of Transformer models without having to compromise on utility, respectively. The model’s type that I used is "Bert" and its name is "Asafaya/Bert-base-Arabic". The accuracy of the result that was gotten is as follows: F1 macro: 0.864, F1 micro: 0.869, competition website on Codalab: 0.8430.
机译:本文介绍了在“Mowjaz多主题标签任务”中的尝试,竞争是通过使用人工智能和神经网络对其主题进行分类,用于对数据集进行分类的编程语言是“Python”的“Python”。 该尝试是通过从“github网站”上传数据集,系统中使用的数据集包括三组,列车,验证和测试数据集。 “Pyarabic”和简单变换器“库允许系统操纵阿拉伯语字母,并简化了变压器模型的使用,而不必分别妥协。模型的我使用的类型是”bert“,它的名称是”bert“ “Asafaya / Bert-Base-Arabic”。所得到的结果的准确性如下:F1宏:0.864,F1 Micro:0.869,Codalab上的竞争网站:0.8430。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号