Building Sentiment Lexicons for All Major Languages

机译：为所有主要语言建立情感词典

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Sentiment analysis in a multilingual world remains a challenging problem, because developing language-specific sentiment lexicons is an extremely resource-intensive process. Such lexicons remain a scarce resource for most languages. In this paper, we address this lexicon gap by building high-quality sentiment lexicons for 136 major languages. We integrate a variety of linguistic resources to produce an immense knowledge graph. By appropriately propagating from seed words, we construct sentiment lexicons for each component language of our graph. Our lexicons have a polarity agreement of 95.7% with published lexicons, while achieving an overall coverage of 45.2%. We demonstrate the performance of our lexicons in an extrinsic analysis of 2,000 distinct historical figures' Wikipedia articles on 30 languages. Despite cultural difference and the intended neutrality of Wikipedia articles, our lexicons show an average sentiment correlation of 0.28 across all language pairs.

机译：在多语言世界中，情感分析仍然是一个具有挑战性的问题，因为开发特定于语言的情感词典是一个非常耗费资源的过程。对于大多数语言而言，此类词典仍然是稀缺资源。在本文中，我们通过为136种主要语言构建高质量的情感词典来解决此词典差距。我们整合了各种语言资源，以产生巨大的知识图。通过从种子词中适当传播，我们为图形的每种组成语言构造了情感词典。我们的词典与已发布的词典的极性一致性为95.7％，而总体覆盖率为45.2％。我们通过对2,000种不同的历史人物的Wikipedia文章对30种语言的外在分析，证明了词典的性能。尽管文化差异和Wikipedia文章的预期中立性，我们的词典显示所有语言对的平均情感相关性为0.28。

著录项

来源
《Annual meeting of the Association for Computational Linguistics》|2014年|383-389|共7页
会议地点
作者
Yanqing Chen; Steven Skiena;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Creating sentiment lexicon for sentiment analysis in Urdu: The case of a resource‐poor language [J] . Asghar Muhammad Zubair, Sattar Anum, Khan Aurangzeb, Expert Systems . 2019,第3期

机译：在乌尔都语中创建用于情感分析的情感词典：资源贫乏的语言案例
2. Creating sentiment lexicon for sentiment analysis in Urdu: The case of a resource‐poor language [J] . Asghar Muhammad Zubair, Sattar Anum, Khan Aurangzeb, Expert Systems . 2019,第3期

机译：在乌尔都语中创造情绪词典的情绪分析：资源差的语言
3. Sentiment lexicons and non-English languages: a survey [J] . Kaity Mohammed, Balakrishnan Vimala Knowledge and information systems . 2020,第12期

机译：情绪词典和非英语语言：调查
4. Building Sentiment Lexicons for All Major Languages [C] . Yanqing Chen, Steven Skiena Annual meeting of the Association for Computational Linguistics . 2014

机译：为所有主要语言构建情商词典
5. Towards Automated Domain-Oriented Lexicon Construction and Dimension Reduction for Arabic Sentiment Analysis [D] . Alshahrani, Hasan A. 2018

机译：面向阿拉伯语情感分析的面向领域的自动词典构建和降维
6. Improving the performance of lexicon-based review sentiment analysis method by reducing additional introduced sentiment bias [O] . Hongyu Han, Yongshi Zhang, Jianpei Zhang, 2012

机译：通过减少其他引入的情感偏见来提高基于词典的评论情感分析方法的性能
7. Building Sentiment Lexicons for All Major Languages [O] . Yanqing Chen, Steven Skiena 2015

机译：为所有主要语言构建情感词典

Building Sentiment Lexicons for All Major Languages

摘要

著录项

相似文献

相关主题

期刊订阅