Sentiment Analysis of Persian-English Code-mixed Texts

机译：波斯英语代码混合文本的情感分析

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The rapid production of data on the internet and the need to understand how users are feeling from a business and research perspective has prompted the creation of numerous automatic monolingual sentiment detection systems. More recently however, due to the unstructured nature of data on social media, we are observing more instances of multilingual and code-mixed texts. This development in content type has created a new demand for code-mixed sentiment analysis systems. In this study we collect, label and thus create a dataset of Persian-English code-mixed tweets. We then proceed to introduce a model which uses BERT pretrained embeddings as well as translation models to automatically learn the polarity scores of these Tweets. Our model outperforms the baseline models that use Naïve Bayes and Random Forest methods.

机译：互联网上的数据快速生产以及了解用户如何从商业和研究视角感受到用户的感受，促使创建了许多自动单声道情绪检测系统。然而，最近，由于社交媒体上的数据的非结构化性质，我们正在观察更多的多语言和代码混合文本的实例。内容类型的此开发为代码混合情感分析系统创造了新的需求。在这项研究中，我们收集，标签，从而创建波斯英语代码混合推文的数据集。然后，我们继续介绍一个模型，它使用BERT预覆盖嵌入品以及翻译模型来自动学习这些推文的极性分数。我们的模型优于使用天真贝斯和随机森林方法的基线模型。

著录项

来源
《International Computer Conference, Computer Society of Iran》|2021年|1-4|共4页
会议地点
作者
Nazanin Sabri; Ali Edalat; Behnam Bahrak;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Sentiment analysis; Analytical models; Social networking (online); Computational modeling; Bit error rate; Blogs; Production;

机译：情绪分析;分析模型;社交网络（在线）;计算建模;误码率;博客;生产;

相似文献

外文文献
中文文献
专利

1. Deep Learning Based Sentiment Analysis in a Code-Mixed English-Hindi and English-Bengali Social Media Corpus [J] . Jamatia Anupam, Swamy Steve Durairaj, Gamback Bjorn, International Journal of Artificial Intelligence Tools: Architectures, Languages, Algorithms . 2020,第5期

机译：基于码混合英语 - 印度和英语 - 孟加拉社交媒体语料库的深度学习情感分析
2. Annotated corpus creation for sentiment analysis in code-mixed Hindi-English (Hinglish) social network data [J] . Neha Garg, Kamlesh Sharma Indian Journal of Science and Technology . 2020,第40期

机译：编码混合后印度英语（HINGISH）社交网络数据中的引向语料库创建
3. An effective cybernated word embedding system for analysis and language identification in code-mixed social media text [J] . Shekhar Shashi, Sharma Dilip Kumar, Sufyan Beg M.M. International journal of knowledge-based and intelligent engineering systems . 2019,第3期

机译：一个有效的电子化词嵌入系统，用于在代码混合的社交媒体文本中进行分析和语言识别
4. Corpus Creation for Sentiment Analysis in Code-Mixed Tamil-English Text [C] . Bharathi Raja Chakravarthi, Vigneshwaran Muralidaran, Ruba Priyadharshini, Joint Spoken Language Technolologies for Under-resourcd Languages and Collaboration and Computing for Under-Resourced Languages Workshop . 2020

机译：混合代码的泰米尔语-英语文本中用于情感分析的语料库创建
5. Fuzzification of Supervised and Semi-Supervised Convolution Neural Networks for Identification of Neutral Text in Sentiment Analysis [D] . ?Najar, Rawan 2020

机译：监督和半监控卷积神经网络的鉴定，用于识别中立文本的情感分析
6. A deep neural network approach for sentiment analysis of medically related texts: an analysis of tweets related to concussions in sports [O] . Kayvan Tirdad, Alex Dela Cruz, Alireza Sadeghian, 2021

机译：医学相关文本情感分析的深度神经网络方法：对运动脑脑脑震荡的推文分析
7. De-Mixing Sentiment from Code-Mixed Text [O] . Yash Kumar Lal, Vaibhav Kumar, Mrinal Dhar, 2019

机译：从Code-Micric文本中的解混情绪

Sentiment Analysis of Persian-English Code-mixed Texts

摘要

著录项

相似文献

相关主题

期刊订阅