【24h】

SentiWordNet for Indian Languages

机译:SentiWordNet for印度语言

获取原文

摘要

The discipline where sentiment/opinion/emotion has been identified and classified in human written text is well known as sentiment analysis. A typical computational approach to sentiment analysis starts with prior polarity lexicons where entries are tagged with their prior out of context polarity as human beings perceive using their cognitive knowledge. Till date, all research efforts found in sentiment lexicon literature deal mostly with English texts. In this article, we propose multiple computational techniques like, WordNet based, dictionary based, corpus based or generative approaches for generating SentiWordNet(s) for Indian languages. Currently, SentiWordNet(s) are being developed for three Indian languages: Bengali, Hindi and Telugu. An online intuitive game has been developed to create and validate the developed SentiWordNet(s) by involving Internet population. A number of automatic, semi-automatic and manual validations and evaluation methodologies have been adopted to measure the coverage and credibility of the developed SentiWordNet(s).
机译:在人类书面文本中已识别出情感/观点/情感并将其分类的学科是众所周知的情感分析。一种典型的情感分析计算方法始于先验极性词典,在该词典中,当人类使用其认知知识来感知条目时,将其标记为先验上下文。直到现在,在情感词典文献中发现的所有研究工作都主要针对英语文本。在本文中,我们提出了多种计算技术,例如基于WordNet的,基于字典的,基于语料库的或生成方法,用于为印度语言生成SentiWordNet。目前,正在为三种印度语言开发孟加拉语,印地语和泰卢固语的SentiWordNet。已开发出一种在线直观的游戏,以通过吸引互联网用户来创建和验证已开发的SentiWordNet。已经采用了许多自动,半自动和手动验证和评估方法来衡量已开发SentiWordNet的覆盖范围和可信度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号