A Semi-supervised Corpus Annotation for Saudi Sentiment Analysis Using Twitter

机译：使用Twitter的半监督语料库注释，用于沙特阿拉伯情绪分析

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In the literature, limited work has been conducted to develop sentiment resources for Saudi dialect. The lack of resources such as dialectical lexicons and corpora are some of the major bottlenecks to the successful development of Arabic sentiment analysis models. In this paper, a semi-supervised approach is presented to construct an annotated sentiment corpus for Saudi dialect using Twitter. The presented approach is primarily based on a list of lexicons built by using word embedding techniques such as word2vec. A huge corpus extracted from twitter is annotated and manually reviewed to exclude incorrect annotated tweets which is publicly available. For corpus validation, state-of-the-art classification algorithms (such as Logistic Regression, Support Vector Machine, and Naive Bayes) are applied and evaluated. Simulation results demonstrate that the Naive Bayes algorithm outperformed all other approaches and achieved accuracy up to 91%.

机译：在文献中，为沙特方言开发情感资源的工作很少。诸如辩证词典和语料库之类的资源不足是成功开发阿拉伯语情感分析模型的主要瓶颈。在本文中，提出了一种半监督方法，使用Twitter为沙特方言构建带注释的情感语料库。提出的方法主要基于通过使用词嵌入技术（例如word2vec）构建的词典列表。对从twitter提取的巨大语料进行批注并进行手动检查，以排除公开可用的不正确批注的tweet。对于语料库验证，应用和评估了最新的分类算法（例如Logistic回归，支持向量机和朴素贝叶斯）。仿真结果表明，朴素贝叶斯算法优于其他所有方法，其准确率高达91％。

著录项

来源
《International conference on brain-inspired cognitive systems》|2018年|589-596|共8页
会议地点
作者
Abdulrahman Alqarafi; Ahsan Adeel; Ahmed Hawalah; Kevin Swingler; Amir Hussain;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Sentiment analysis; Saudi dialect; Word embedding;

机译：情绪分析;沙特方言;词嵌入;

相似文献

外文文献
中文文献
专利

1. A Twitter Sentiment Analysis Model for Measuring Security and Educational Challenges: A Case Study in Saudi Arabia [J] . Hassan Abdullah Alqarni, Yahya AlMurtadha, Abdelrahman Osman Elfaki Journal of computer sciences . 2018,第3期

机译：用于衡量安全和教育挑战的Twitter情感分析模型：以沙特阿拉伯为例
2. A Twitter Sentiment Analysis Model for Measuring Security and Educational Challenges: A Case Study in Saudi Arabia [J] . Alqarni Hassan Abdullah, AlMurtadha Yahya, Elfaki Abdelrahman Osman Journal of computer sciences . 2018,第3期

机译：用于衡量安全和教育挑战的Twitter情绪分析模型：以沙特阿拉伯为例
3. AraCust: a Saudi Telecom Tweets corpus for sentiment analysis [J] . Latifah Almuqren, Alexandra Cristea PeerJ Computer Science . 2021,第a期

机译：Aracust：沙特电信推文语料库的情绪分析
4. A Saudi Dialect Twitter Corpus for Sentiment and Emotion Analysis [C] . Abdulmohsen Al-Thubaity, Mohammed Alharbi, Saif Alqahtani, Saudi Computer Society National Computer Conference . 2018

机译：沙特方言Twitter语料库，用于情感和情感分析
5. Saudis in the Eyes of the Other: A Corpus-Driven Critical Discourse Study of the Representation of Saudis on Twitter [D] . Alanazi, Faizah Mohammed. 2020

机译：在另一个眼中的沙特人：一个语料库驱动的批判性话语研究，对Twitter上的沙特人表示
6. An Effective BERT-Based Pipeline for Twitter Sentiment Analysis: A Case Study in Italian [O] . Marco Pota, Mirko Ventura, Rosario Catelli, 2021

机译：一种有效的基于伯特语的管道用于Twitter情绪分析 - 以意大利语为例
7. Driving Change on Twitter: A Corpus-Assisted Discourse Analysis of the Twitter Debates on the Saudi Ban on Women Driving [O] . Lama Altoaimy 2018

机译：推特驾驶变更：关于沙特语禁令的Twitter辩论的语料库辅助话语分析

A Semi-supervised Corpus Annotation for Saudi Sentiment Analysis Using Twitter

摘要

著录项

相似文献

相关主题

期刊订阅