首页> 外文会议>IEEE/ACS International Conference on Computer Systems and Applications >Developing Turkish sentiment lexicon for sentiment analysis using online news media
【24h】

Developing Turkish sentiment lexicon for sentiment analysis using online news media

机译:使用在线新闻媒体开发土耳其情绪词典进行情绪分析

获取原文

摘要

Internet is a very rich resource of documents that need to be analysed to extract their sentimental values. Sentiment Analysis which is a subfield of Natural Language Processing discipline focuses on this issue. The existence of sentiment lexicons in their own language is a very important resource for scientists studying in sentiment analysis field. Since many studies of sentiment analysis have been conducted on text written in English language, developed methods and resources for English may not produce the desired results in other languages. In Turkish, a rich sentiment lexicon does not exists, such as SentiWordNet for English. In this study, we aimed to develop Turkish sentiment lexicon, and we enhanced an existing lexicon which has 27K Turkish words to 37K words. For quantifying the performance of this enhanced lexicon, we tested both lexicons on domain independent news texts. The accuracy of determining the polarity of news written in Turkish has been increased from 60.6% to 72.2%.
机译:互联网是一种非常丰富的文件资源,需要分析以提取灵感值。作为自然语言处理纪律的子领域的情感分析侧重于此问题。以自己的语言存在情绪词典是在情绪分析领域进行学习的科学家的一个非常重要的资源。由于对语言编写的文本进行了许多情感分析,因此开发的英语方法和资源可能不会以其他语言产生预期的结果。在土耳其语中,丰富的情绪词典不存在,例如SentiWordNet用于英语。在这项研究中,我们旨在开发土耳其情绪词典,我们加强了一个现有的词典,其中有27k的土耳其语给37k字。为了量化此增强型词典的性能,我们在域独立新闻文本上测试了lexicons。确定用土耳其语编写的新闻极性的准确性从60.6 %增加到72.2 %。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号