首页> 外文会议>IEEE/ACS International Conference on Computer Systems and Applications >Developing Turkish sentiment lexicon for sentiment analysis using online news media
【24h】

Developing Turkish sentiment lexicon for sentiment analysis using online news media

机译:使用在线新闻媒体开发用于分析情感的土耳其语情感词典

获取原文

摘要

Internet is a very rich resource of documents that need to be analysed to extract their sentimental values. Sentiment Analysis which is a subfield of Natural Language Processing discipline focuses on this issue. The existence of sentiment lexicons in their own language is a very important resource for scientists studying in sentiment analysis field. Since many studies of sentiment analysis have been conducted on text written in English language, developed methods and resources for English may not produce the desired results in other languages. In Turkish, a rich sentiment lexicon does not exists, such as SentiWordNet for English. In this study, we aimed to develop Turkish sentiment lexicon, and we enhanced an existing lexicon which has 27K Turkish words to 37K words. For quantifying the performance of this enhanced lexicon, we tested both lexicons on domain independent news texts. The accuracy of determining the polarity of news written in Turkish has been increased from 60.6% to 72.2%.
机译:互联网是非常丰富的文档资源,需要对其进行分析以提取其情感价值。情感分析是自然语言处理学科的一个子领域,着重于此问题。以自己的语言存在的情感词典对于在情感分析领域进行研究的科学家来说是非常重要的资源。由于已经对用英语书写的文本进行了情感分析的许多研究,因此英语开发的方法和资源可能无法用其他语言产生预期的结果。在土耳其语中,不存在丰富的情感词典,例如英语的SentiWordNet。在这项研究中,我们旨在开发土耳其语情感词典,并将现有的具有27K土耳其语单词的词典增强为37K单词。为了量化此增强词典的性能,我们在独立于域的新闻文本上测试了这两个词典。确定土耳其语新闻极性的准确性从60.6%提高到72.2%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号