首页> 外文会议>International conference on computer processing of oriental languages >An Empirical Study on Chinese Microblog Stance Detection Using Supervised and Semi-supervised Machine Learning Methods
【24h】

An Empirical Study on Chinese Microblog Stance Detection Using Supervised and Semi-supervised Machine Learning Methods

机译:使用监督和半监控机器学习方法对中国微博立场检测的实证研究

获取原文

摘要

Nowadays, more and more people are willing to express their opinions and attitudes in the microblog platform. Stance detection refers to the task that judging whether the author of the text is in favor of or against the given target. Most of the existing literature are for the debates or online conversations, which have adequate context for inferring the authors' stances. However, for detecting the stance in microblogs, we have to figure out the stance of the author only based on the unique and separate microblog, which sets new obstacles for this task. In this paper, we conduct a comprehensive empirical study on microblog stance detection using supervised and semi-supervised machine learning methods. Different unbalanced data processing strategies and classifiers, such as Linear SVM, Naive Bayes and Random Forest, are compared using NLPCC 2016 Stance Detection Evaluation Task dataset. Experiment results show that the method based on ensemble learning and SMOTE2 unbalanced processing with sentiment word features outperforms the best submission result in NLPCC 2016 Evaluation Task.
机译:如今,越来越多的人愿意在微博平台上表达他们的意见和态度。姿态检测是指判断文本作者是否赞成或反对给定目标的任务。现有的大多数文献都是为了辩论或在线对话,这对于推断作者的立场具有足够的背景。但是,对于检测到微博中的立场,我们必须仅基于独特和独立的微博来弄清楚作者的立场,这为此任务设置了新的障碍。在本文中,我们使用监督和半监督机器学习方法对微博立场检测进行了全面的实证研究。使用NLPCC 2016 Stance检测评估任务数据集进行比较不同的不平衡数据处理策略和分类器,例如线性SVM,Naive Bayes和随机林。实验结果表明,基于集合学习的方法和情绪词的阐述不平衡处理概率优于NLPCC 2016评估任务的最佳提交结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号