首页> 外文期刊>Computational Social Networks >A review: preprocessing techniques and data augmentation for sentiment analysis
【24h】

A review: preprocessing techniques and data augmentation for sentiment analysis

机译:综述:心情分析的预处理技术和数据增强

获取原文
获取外文期刊封面目录资料

摘要

In literature, the machine learning-based studies of sentiment analysis are usually supervised learning which must have pre-labeled datasets to be large enough in certain domains. Obviously, this task is tedious, expensive and time-consuming to build, and hard to handle unseen data. This paper has approached semi-supervised learning for Vietnamese sentiment analysis which has limited datasets. We have summarized many preprocessing techniques which were performed to clean and normalize data, negation handling, intensification handling to improve the performances. Moreover, data augmentation techniques, which generate new data from the original data to enrich training data without user intervention, have also been presented. In experiments, we have performed various aspects and obtained competitive results which may motivate the next propositions.
机译:在文献中,基于机器学习的情感分析研究通常是监督学习,必须在某些域中具有预先标记的数据集足够大。显然,这项任务是繁琐的,昂贵且耗时的构建,并且难以处理看不见的数据。本文已接近具有有限数据集的越南情绪分析的半监督学习。我们已经总结了许多预处理技术,以清洁和规范化数据,否定处理,强化处理以改善性能。此外,还提出了从原始数据生成新数据以丰富无用户干预的培训数据的数据增强技术。在实验中,我们已经进行了各个方面,并获得了可能激发下一个命题的竞争结果。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号