首页> 外文期刊>Journal of Information Science >Polarity classification for Spanish tweets using the COST corpus
【24h】

Polarity classification for Spanish tweets using the COST corpus

机译:使用COST语料库对西班牙推文进行极性分类

获取原文
获取原文并翻译 | 示例
           

摘要

It was not until 2010 when businesses, politicians and people in general began to realize the potential of Twitter in Spain. This fact has awoken research interest in the extraction of knowledge from Twitter. This paper aims to fill the gap of the lack of resources for Twitter sentiment analysis in Spanish by performing a study of different features and machine learning algorithms for classifying the polarity of Twitter posts. The result is a new corpus of Spanish tweets called COST, and we have carried out a wide-ranging experiment in which different machine learning algorithms have been used. Furthermore, we have tested the influence of using different weighting schemes for unigrams, the influence of eliminating stop-words and the application of a stemmer process.
机译:直到2010年,企业,政界人士和普通民众才开始意识到Twitter在西班牙的潜力。这一事实引起了人们对从Twitter提取知识的研究兴趣。本文旨在通过对不同功能和机器学习算法进行研究以对Twitter帖子的极性进行分类,以填补西班牙语中Twitter情感分析资源不足的空白。结果是西班牙推文的新语料库称为COST,并且我们进行了广泛的实验,其中使用了不同的机器学习算法。此外,我们测试了使用不同权重方案的字母组合的影响,消除停用词的影响以及词干处理的应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号