首页> 外文期刊>Journal of Information Science >A weakly supervised approach to Chinese sentiment classification using partitioned self-training
【24h】

A weakly supervised approach to Chinese sentiment classification using partitioned self-training

机译:基于分区自我训练的弱监督汉语情感分类方法

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

With the rapid evolution of documents on the World Wide Web which express opinions, there exists an increasing demand for developing such a sentiment analysis technique that can easily adapt to new domains with minimum supervision. This article introduces a novel weakly supervised approach for Chinese sentiment classification. The approach applies a variant of self-training algorithm on two partitions split from test dataset, and combines classification results of the two partitions into a pseudo-labelled training set and an unlabelled test set, then trains an initial classifier on the pseudo-labelled training set and adopts a standard self-learning cycle to obtain the overall classification results. Experiments on the four datasets from two domains show that our approach has competitive advantages over baseline approaches; it even outperforms the supervised approach in some of the datasets despite using no labelled documents.
机译:随着表达意见的万维网上文档的快速发展,对开发这样一种情感分析技术的需求日益增加,该技术可以在最少的监督下轻松地适应新的领域。本文介绍了一种新颖的弱监督汉语情感分类方法。该方法将自训练算法的变体应用于从测试数据集拆分的两个分区,并将两个分区的分类结果组合为伪标记的训练集和未标记的测试集,然后在伪标记的训练中训练初始分类器设定并采用标准的自学周期以获得总体分类结果。来自两个领域的四个数据集的实验表明,我们的方法比基线方法具有竞争优势。即使不使用带标签的文档,它甚至在某些数据集中也优于监督方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号