...
首页> 外文期刊>Journal of the American Society for Information Science >Exploring Co-Training Strategies for Opinion Detection
【24h】

Exploring Co-Training Strategies for Opinion Detection

机译:探索意见侦查的共同训练策略

获取原文
获取原文并翻译 | 示例

摘要

For the last decade or so, sentiment analysis, which aims to automatically identify opinions, polarities, or emotions from user-generated content (e.g., blogs, tweets), has attracted interest from both academic and industrial communities. Most sentiment analysis strategies fall into 2 categories: lexicon-based and corpus-based approaches. While the latter often requires sentiment-labeled data to build a machine learning model, both approaches need sentiment-labeled data for evaluation. Unfortunately, most data domains lack sufficient quantities of labeled data, especially at the subdocument level. Semisupervised learning (SSL), a machine learning technique that requires only a few labeled examples and can automatically label unlabeled data, is a promising strategy to deal with the issue of insufficient labeled data. Although previous studies have shown promising results of applying various SSL algorithms to solve a sentiment-analysis problem, co-training, an SSL algorithm, has not attracted much attention for sentiment analysis largely due to its restricted assumptions. Therefore, this study focuses on revisiting co-training in depth and discusses several co-training strategies for sentiment analysis following a loose assumption. Results suggest that co-training can be more effective than can other currently adopted SSL methods for sentiment analysis.
机译:在过去的十年左右的时间里,旨在从用户生成的内容(例如,博客,推文)中自动识别观点,极性或情感的情感分析引起了学术界和工业界的关注。大多数情感分析策略分为两类:基于词典的方法和基于语料库的方法。尽管后者通常需要带有情感标签的数据来构建机器学习模型,但两种方法都需要带有情感标签的数据来进行评估。不幸的是,大多数数据域缺少足够数量的标记数据,尤其是在子文档级别。半监督学习(SSL)是一种机器学习技术,它仅需要几个带有标签的示例,并且可以自动为未标签的数据添加标签,这是解决标签数据不足问题的一种有前途的策略。尽管以前的研究表明,应用各种SSL算法解决情感分析问题的结果令人鼓舞,但是由于SSL算法的局限性,共训练SSL算法并未引起人们对情感分析的广泛关注。因此,本研究的重点是重新探讨深度共训练,并讨论了一个宽松的假设之后用于情感分析的几种共训练策略。结果表明,对于情绪分析,共同训练可能比其他当前采用的SSL方法更有效。

著录项

  • 来源
  • 作者

    Ning Yu;

  • 作者单位

    School of Library and Information Science, University of Kentucky, 329 Little Library Building, Lexington, KY 40506-0224;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号