Sentiment Analysis of Sinhala News Comments

Ranathunga Surangika; Liyanage Isuru Udara

首页> 外文期刊>ACM transactions on Asian and low-resource language information processing >Sentiment Analysis of Sinhala News Comments

【24h】

Sentiment Analysis of Sinhala News Comments

机译：Sinhala新闻评论的情感分析

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Sinhala is a low-resource language, for which basic language and linguistic tools have not been properly defined. This affects the development of NLP-based end-user applications for Sinhala. Thus, when implementing NLP tools such as sentiment analyzers, we have to rely only on language-independent techniques. This article presents the use of such language-independent techniques in implementing a sentiment analysis system for Sinhala news comments. We demonstrate that for low-resource languages such as Sinhala, the use of recently introduced word embedding models as semantic features can compensate for the lack of well-developed language-specific linguistic or language resources, and text classification with acceptable accuracy is indeed possible using both traditional statistical classifiers and Deep Learning models. The developed classification models, a corpus of 8.9 million tokens extracted from Sinhala news articles and user comments, and Sinhala Word2Vec and fastText word embedding models are now available for public use; 9,048 news comments annotated with POSITIVE/NEGATIVE/NEUTRAL polarities have also been released.

机译：Sinhala是一种低资源语言，基本语言和语言工具尚未正确定义。这会影响Sinhala的基于NLP的最终用户应用程序的开发。因此，在实现诸如情感分析仪的NLP工具时，我们必须仅依赖于独立于语言的技术。本文介绍了这种语言独立技术在实施Sinhala新闻评论中的情绪分析系统方面。我们证明，对于诸如Sinhala等低资源语言，使用最近引入的Word嵌入模型作为语义特征可以补偿缺乏发达的语言特定语言或语言资源，以及具有可接受准确性的文本分类确实可以使用传统统计分类器和深层学习模型。发达的分类模型，从僧伽罗新闻文章和用户评论中提取了890万令牌的语料库，现在可以公开使用Sinhala Word2Vec和FastText Word嵌入模型; 9,048新闻评论呈阳性/负面/中性极性也已发布。

著录项

来源
《ACM transactions on Asian and low-resource language information processing》 |2021年第4期|59.1-59.23|共23页
作者
Ranathunga Surangika; Liyanage Isuru Udara;
展开▼
作者单位

Univ Moratuwa Dept Comp Sci & Engn Katubedda 10400 Sri Lanka;

Univ Moratuwa Dept Comp Sci & Engn Katubedda 10400 Sri Lanka;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Sinhala; news comments; sentiment analysis; text classification;

机译：Sinhala;新闻评论;情感分析;文本分类;

相似文献

外文文献
中文文献
专利

1. Subjective Sentiment Analysis for Arabic Newswire Comments [J] . Sadik Bessou Journal of digital information management . 2019,第5期

机译：阿拉伯新闻通讯社评论的主观情绪分析
2. Lexicon-based Comments-oriented News Sentiment Analyzer system [J] . A. Moreo, M. Romero, J.L. Castro, Expert Systems with Application . 2012,第10期

机译：基于词典的基于评论的新闻情绪分析器系统
3. Public Expressions of Private Sentiments: Unveiling the Pulse of Racial Tolerance through Online News Readers' Comments [J] . Jaime Lokea* Howard Journal of Communications . 2012,第3期

机译：私人情感的公开表达：通过在线新闻读者的评论揭示种族容忍的脉搏
4. Sentiment Analysis of Sinhala News Comments using Sentence-State LSTM Networks [C] . Piyumal Demotte, Lahiru Senevirathne, Binod Karunanayake, Moratuwa Engineering Research Conference . 2020

机译：使用句子状态LSTM网络的Sinhala新闻评论的情感分析
5. The effect of online news story comments on other readers' attitudes: Focusing on the case of incongruence between news tone and comments. [D] . Ahn, Hyonjin. 2011

机译：在线新闻故事评论对其他读者态度的影响：以新闻语气与评论之间不一致的情况为重点。
6. Vaccine misinformation on social media – topic-based content and sentiment analysis of Polish vaccine-deniers’ comments on Facebook [O] . Krzysztof Klimiuk, Agnieszka Czoska, Karolina Biernacka, 2021

机译：关于社会媒体的疫苗误导 - 基于主题的主题 - 波兰疫苗 - 旦尼斯对Facebook评论的情感分析
7. Data Set for Stance and Sentiment Analysis from User Comments on Croatian News [O] . Mihaela Bošnjak, Mladen Karan 2019

机译：来自克罗地亚新闻的用户评论的立场和情感分析的数据

Sentiment Analysis of Sinhala News Comments

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅