A Twitter Corpus and Benchmark Resources for German Sentiment Analysis

机译：Twitter语料库和用于德国情绪分析的基准资源

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we present SB10k, a new corpus for sentiment analysis with approx. 10,000 German tweets. We use this new corpus and two existing corpora to provide state-of-the-art benchmarks for sentiment analysis in German: we implemented a CNN (based on the winning system of SemEval-2016) and a feature-based SVM and compare their performance on all three corpora. For the CNN, we also created German word embeddings trained on 300M tweets. These word embeddings were then optimized for sentiment analysis using distant-supervised learning. The new corpus, the German word embeddings (plain and optimized), and source code to re-run the benchmarks are publicly available.

机译：在本文中，我们介绍了SB10k，这是一种用于情感分析的新语料库，大约有。 10,000条德国推文。我们使用这个新的语料库和两个现有的语料库来提供德语情感分析的最新基准：我们实施了CNN（基于SemEval-2016的获奖系统）和基于功能的SVM，并比较了它们的性能在所有三个语料库上。对于CNN，我们还创建了在3亿条推文上训练的德语单词嵌入。然后，使用远程监督学习对这些词嵌入进行优化以进行情感分析。新的语料库，德语单词嵌入（纯文本和优化的）以及重新运行基准测试的源代码是公开可用的。

著录项

来源
《Fifth international workshop on natural language processing for social media》|2017年|45-51|共7页
会议地点 Valencia(ES)
作者
Mark Cieliebak; Jan Deriu; Dominic Egger; Fatih Uzdilli;
展开▼
作者单位

SpinningBytes;

Zurich University of Applied Sciences;

Zurich University of Applied Sciences;

Zurich University of Applied Sciences;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. The State-of-the-Art in Twitter Sentiment Analysis: A Review and Benchmark Evaluation [J] . DAVID ZIM B RA, AHMED ABBASI, DANIEL ZENG, ACM Transactions on Management Information Systems . 2018,第2期

机译：Twitter情绪分析中的最新技术：回顾和基准评估
2. Sentiment Analysis of Twitter Audiences: Measuring the Positive or Negative Influence of Popular Twitterers [J] . Younggue Bae, Hongchul Lee Journal of the American Society for Information Science and Technology . 2012,第12期

机译：Twitter观众的情感分析：衡量受欢迎的Twitter员工的正面或负面影响
3. Sentiment mapping: point pattern analysis of sentiment classified Twitter data [J] . Camacho Ken, Portelli Raechel, Shortridge Ashton, Cartography and geographic information science . 2021,第3期

机译：情绪映射：情绪的点模式分析分类推特数据
4. Twitter corpus of Resource-Scarce Languages for Sentiment Analysis and Multilingual Emoji Prediction [C] . Nurendra Choudhary, Rajat Singh, Vijjini Anvesh Rao, International conference on computational linguistics . 2018

机译：资源稀缺语言的Twitter语料库，用于情感分析和多语言表情符号预测
5. A New Opinion Review Methodology for Arabic Twitter Sentiment Analysis [D] . Almalki, Abdullah S. 2020

机译：阿拉伯语推特情绪分析的新意见审查方法
6. An Effective BERT-Based Pipeline for Twitter Sentiment Analysis: A Case Study in Italian [O] . Marco Pota, Mirko Ventura, Rosario Catelli, 2021

机译：一种有效的基于伯特语的管道用于Twitter情绪分析 - 以意大利语为例
7. A Twitter corpus and benchmark resources for german sentiment analysis [O] . Cieliebak Mark, Deriu Jan Milan, Egger Dominic, 2017

机译：Twitter语料库和用于德国情绪分析的基准资源

A Twitter Corpus and Benchmark Resources for German Sentiment Analysis

摘要

著录项

相似文献

相关主题

期刊订阅