【24h】

Active Learning for Imbalanced Sentiment Classification

机译:主动学习,情感分类失衡

获取原文

摘要

Active learning is a promising way for sentiment classification to reduce the annotation cost. In this paper, we focus on the imbalanced class distribution scenario for sentiment classification, wherein the number of positive samples is quite different from that of negative samples. This scenario posits new challenges to active learning. To address these challenges, we propose a novel active learning approach, named co-selecting, by taking both the imbalanced class distribution issue and uncertainty into account. Specifically, our co-selecting approach employs two feature subspace classifiers to collectively select most informative minority-class samples for manual annotation by leveraging a certainty measurement and an uncertainty measurement, and in the meanwhile, automatically label most informative majority-class samples, to reduce human-annotation efforts. Extensive experiments across four domains demonstrate great potential and effectiveness of our proposed co-selecting approach to active learning for imbalanced sentiment classification.
机译:主动学习是一种情感分类的有前途的方式,可以降低注释成本。在本文中,我们集中于情感分类的不平衡类别分布场景,其中正样本的数量与负样本的数量有很大差异。这种情况对主动学习提出了新的挑战。为了应对这些挑战,我们提出了一种新颖的主动学习方法,即选课,要兼顾班级分配不平衡和不确定性。具体来说,我们的共选方法利用两个特征子空间分类器,通过利用确定性度量和不确定性度量来共同选择信息量最大的少数类别样本进行手动注释,同时自动标记大多数信息的少数类别样本以减少人工注释的工作。跨越四个领域的广泛实验证明了我们提出的主动选择学习方法在情感分类失衡方面的巨大潜力和有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号