AraSenTi-Tweet: A Corpus for Arabic Sentiment Analysis of Saudi Tweets

Nora Al-Twairesh; Hend Al-Khalifa; AbdulMalik Al-Salman; Yousef Al-Ohali

首页> 外文期刊>Procedia Computer Science >AraSenTi-Tweet: A Corpus for Arabic Sentiment Analysis of Saudi Tweets

【24h】

AraSenTi-Tweet: A Corpus for Arabic Sentiment Analysis of Saudi Tweets

机译：AraSenTi-Tweet：沙特推文阿拉伯语情感分析的语料库

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Arabic Sentiment Analysis is an active research area these days. However, the Arabic language still lacks sufficient language resources to enable the tasks of sentiment analysis. In this paper, we present the details of collecting and constructing a large dataset of Arabic tweets. The techniques used in cleaning and pre-processing the collected dataset are explained. A corpus of Arabic tweets annotated for sentiment analysis was extracted from this dataset. The corpus consists mainly of tweets written in Modern Standard Arabic and the Saudi dialect. The corpus was manually annotated for sentiment. The annotation process is explained in detail and the challenges during the annotation are highlighted. The corpus contains 17,573 tweets labelled with four labels for sentiment: positive, negative, neutral and mixed. Baseline experiments were conducted to provide benchmark results for future work.

机译：如今，阿拉伯语情绪分析是一个活跃的研究领域。但是，阿拉伯语仍然缺乏足够的语言资源来执行情感分析任务。在本文中，我们介绍了收集和构建大型阿拉伯语推文数据集的详细信息。清理和预处理收集的数据集的技术进行了说明。从该数据集中提取了注释为情感分析的阿拉伯语推文语料库。语料库主要包含以现代标准阿拉伯语和沙特方言编写的推文。对该语料库进行了手动注释以表达情感。详细说明了注释过程，并突出了注释过程中的挑战。语料库包含17,573条推文，标有四个情感标签：正面，负面，中立和混合。进行了基线实验，以为将来的工作提供基准结果。

著录项

来源
《Procedia Computer Science》 |2017年第1期|共10页
作者
Nora Al-Twairesh; Hend Al-Khalifa; AbdulMalik Al-Salman; Yousef Al-Ohali;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
入库时间 2022-08-18 20:06:48

相似文献

外文文献
中文文献
专利

1. Cyberbullying Detection by Sentiment Analysis of Tweets' Contents Written in Arabic in Saudi Arabia Society [J] . Amjad Rasmi Almutairi, Muhammad Abdullah Al-Hagery International journal of computer science and network security . 2021,第3期

机译：沙特阿拉伯社会阿拉伯语中写道的推文内容的情感分析
2. AraCust: a Saudi Telecom Tweets corpus for sentiment analysis [J] . Latifah Almuqren, Alexandra Cristea PeerJ Computer Science . 2021,第a期

机译：Aracust：沙特电信推文语料库的情绪分析
3. Sentiment lexicon for sentiment analysis of Saudi dialect tweets [J] . Abdulmohsen Al-Thubaity, Qubayl Alqahtani, Abdulaziz Aljandal Procedia Computer Science . 2018,第1期

机译：用于沙特方言推文情感分析的情感词典
4. Sentiment Analysis of Arabic Tweets in Smart Cities: A Review of Saudi Dialect [C] . Shoayee Alotaibi, Rashid Mehmood, Iyad Katib International Conference on Fog and Mobile Edge Computing . 2019

机译：智慧城市中阿拉伯语推文的情感分析：沙特方言评论
5. Deep Neural Networks for Sentiment Analysis in Tweets with Emoticons [D] . Narayanaperumal, Mutharasu. 2020

机译：与表情符号的推文情感分析的深神经网络
6. A deep neural network approach for sentiment analysis of medically related texts: an analysis of tweets related to concussions in sports [O] . Kayvan Tirdad, Alex Dela Cruz, Alireza Sadeghian, 2021

机译：医学相关文本情感分析的深度神经网络方法：对运动脑脑脑震荡的推文分析
7. Developing Lexicon-based Algorithms and Sentiment Lexicon for Sentiment Analysis of Saudi Dialect Tweets [O] . Waleed Al-Ghaith 2019

机译：开发基于词汇的算法和情感词典，用于沙特语方言推文的情感分析

AraSenTi-Tweet: A Corpus for Arabic Sentiment Analysis of Saudi Tweets

摘要

著录项

相似文献

相关主题

期刊订阅