首页> 外文会议>IEEE International Conference on Bioinformatics and Biomedicine >Predictive modelling of stigmatized behaviour in vaccination discussions on Facebook
【24h】

Predictive modelling of stigmatized behaviour in vaccination discussions on Facebook

机译:Facebook疫苗接种讨论中污名化行为的预测模型

获取原文

摘要

Facebook often serves as a platform for sharing health-related information and is a venue to express attitudes, thoughts, and frustrations within groups centered around healthcare themes. This information can be utilized for public health monitoring, with the aim of tackling stigmatized and stereotypical attitudes in relation to immunization or other health related issues expressed in social media. However, the effectiveness of those attempts will rest on our understanding of the concept of stigma and its correct modeling. In this study, we aim to expand the small pool of existing computational studies on the topic of stigma identification in a health care context. More specifically, we compare the following models using a dataset of 2,761 comments from Facebook: Convolutional Neural Network (CNN): Term Frequency-Inverse Document Frequency (TF-IDF) with Logistic Regression (LR), Support Vector Machine (SVM), Naive Bayes (NB), Multilayer Perceptron (MLP), Random Forest (RF), K-nearest neighbours (KNN), and Stochastic Gradient Descent (SGDC), Long short-term memory networks (LSTM), Bidirectional long short-term memory (BiLSTM), and fastText. Accuracy results as evaluated on an unbalanced data subset (with limited training samples) show that fastText gives the best performance, although BiLSTM and CNN achieve comparably good results on unbalanced data as well. CNN algorithm significantly outperforms other algorithms on balanced version of the dataset according to a paired sample t-test ( ).
机译:Facebook经常充当共享与健康相关信息的平台,并且是一个以医疗保健主题为中心表达态度,思想和挫败感的场所。该信息可用于公共健康监测,目的是解决在社交媒体上表达的与免疫或其他与健康相关的问题有关的污名化和陈规定型态度。但是,这些尝试的有效性将取决于我们对柱头概念及其正确建模的理解。在这项研究中,我们的目标是在医疗保健环境中扩大关于耻辱识别主题的现有计算研究的一小部分。更具体地说,我们使用来自Facebook的2,761条评论的数据集比较以下模型:卷积神经网络(CNN):带有Logistic回归(LR)的词频逆文档频率(TF-IDF),支持向量机(SVM),朴素贝叶斯(NB),多层感知器(MLP),随机森林(RF),K近邻(KNN)和随机梯度下降(SGDC),长短期记忆网络(LSTM),双向长短期记忆( BiLSTM)和fastText。对不平衡数据子集(具有有限的训练样本)进行评估的准确性结果表明,尽管BiLSTM和CNN在不平衡数据上也取得了相当不错的结果,但fastText的性能最佳。根据配对样本t检验(),CNN算法在数据集的平衡版本上明显优于其他算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号