Polarity classification for Spanish tweets using the COST corpus

Eugenio Martinez-Camara; M. Teresa Martin-Valdivia; L. Alfonso Urena-Lopez; Ruslan Mitkov

首页> 外文期刊>Journal of Information Science >Polarity classification for Spanish tweets using the COST corpus

【24h】

Polarity classification for Spanish tweets using the COST corpus

机译：使用COST语料库对西班牙推文进行极性分类

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

It was not until 2010 when businesses, politicians and people in general began to realize the potential of Twitter in Spain. This fact has awoken research interest in the extraction of knowledge from Twitter. This paper aims to fill the gap of the lack of resources for Twitter sentiment analysis in Spanish by performing a study of different features and machine learning algorithms for classifying the polarity of Twitter posts. The result is a new corpus of Spanish tweets called COST, and we have carried out a wide-ranging experiment in which different machine learning algorithms have been used. Furthermore, we have tested the influence of using different weighting schemes for unigrams, the influence of eliminating stop-words and the application of a stemmer process.

机译：直到2010年，企业，政界人士和普通民众才开始意识到Twitter在西班牙的潜力。这一事实引起了人们对从Twitter提取知识的研究兴趣。本文旨在通过对不同功能和机器学习算法进行研究以对Twitter帖子的极性进行分类，以填补西班牙语中Twitter情感分析资源不足的空白。结果是西班牙推文的新语料库称为COST，并且我们进行了广泛的实验，其中使用了不同的机器学习算法。此外，我们测试了使用不同权重方案的字母组合的影响，消除停用词的影响以及词干处理的应用。

著录项

来源
《Journal of Information Science》 |2015年第3期|263-272|共10页
作者
Eugenio Martinez-Camara; M. Teresa Martin-Valdivia; L. Alfonso Urena-Lopez; Ruslan Mitkov;
展开▼
作者单位

Computer Science Department, University of Jaen, Campus Las Lagunillas s, 23071, Jaen, Spain;

Computer Science Department, University of Jaen, Spain;

Computer Science Department, University of Jaen, Spain;

Research Institute for Information and Language Processing, University of Wolverhampton, UK;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Opinion mining; polarity classification; sentiment analysis; short text analysis; social networks; Spanish corpus; Twitter;

机译：意见挖掘;极性分类;情绪分析;短文分析;社交网络;西班牙文集;推特;

相似文献

外文文献
中文文献
专利

1. Relevance of the SFU Review_(SP)-NEG corpus annotated with the scope of negation for supervised polarity classification in Spanish [J] . Salud María Jiménez-Zafra, M. Teresa Martín-Valdivia, M. Dolores Molina-González, Information Processing & Management . 2018,第2期

机译：SFU Review_（SP）-NEG语料库的相关性，带有否定范围的西班牙语监督性极性分类
2. A Deep Learning-based Approach for Emotions Classification in Big Corpus of Imbalanced Tweets [J] . Jamal Nasir, Chen Xianqiao, Al-Turjman Fadi, ACM transactions on Asian and low-resource language information processing . 2021,第3期

机译：基于深入的学习的情绪分类方法，在不平衡推文中的大语料库中
3. A Spanish semantic orientation approach to domain adaptation for polarity classification [J] . M. Dolores Molina-Gonzalez, Eugenio Martinez-Camara, M. Teresa Martin-Valdivia, Information Processing & Management . 2015,第4期

机译：西班牙语义取向的极性分类领域适应方法
4. Deep Neural Network Comparison for Spanish Tweets Polarity Classification [C] . Esteban Rodríguez Betancourt, Pablo Sauma Chacón, Edgar Casasola Murillo Latin American Computing Conference . 2019

机译：西班牙推文极性分类的深度神经网络比较
5. A corpus-based study of change and variation in much, many, far and often as Negative Polarity Items. [D] . Lee, Ji Won. 2015

机译：基于语料库的，很多，很多，很远和经常作为负极性项的变化和变异的研究。
6. Twitter classification model: the ABC of two million fitness tweets [O] . Theodore A. Vickey, Kathleen Martin Ginis, Maciej Dabrowski 2013

机译：Twitter分类模型：200万条健身推文的ABC
7. The SenSem Corpus: an annotated corpus for Spanish and Catalan with information about aspectuality, modality, polarity and factuality [O] . Fernández, Ana, Vázquez García, Glòria 2014

机译：sensem语料库：西班牙语和加泰罗尼亚语的注释语料库，提供有关方面性，形态，极性和事实性的信息

Polarity classification for Spanish tweets using the COST corpus

摘要

著录项

相似文献

相关主题

期刊订阅