首页> 外文会议>MEDINFO >Finding Reasons for Vaccination Hesitancy: Evaluating Semi-Automatic Coding of Internet Discussion Forums
【24h】

Finding Reasons for Vaccination Hesitancy: Evaluating Semi-Automatic Coding of Internet Discussion Forums

机译:发现疫苗接种犹豫不决的原因:评估互联网讨论论坛的半自动编码

获取原文

摘要

Computer-assisted text coding can facilitate the analysis of large text collections. To evaluate the functionality of providing an analyst with a ranked list of suggestions for suitable text codes, we used a data set of discussion posts, which had been manually coded for reasons given for taking a stance on the topic of vaccination. We trained a logistic regression classifier to rank these reasons according to the probability that they would be present in the post. The approach was evaluated for its ability to include the expected reasons among the n top-ranked reasons, using an n between 1 and 6. The logistic regression-based ranking was more effective than the baseline, which ranked reasons according to their frequency in the training data. Providing such a list of possible codes, ranked by logistic regression, could therefore be a useful feature in a tool for text coding.
机译:计算机辅助文本编码可以促进大文本集合的分析。 为了评估具有对合适的文本代码的排名列表提供分析师的功能,我们使用了一组数据集,这已被手动编码,原因是在接种疫苗的主题上进行立场。 我们培训了一个Logistic回归分类器,根据他们将在帖子中存在的概率进行排名这些原因。 评估该方法的能力,以便在N个排名最佳的原因中包含预期原因,使用1和6之间的n。基于逻辑回归的排名比基线更有效,这是根据其频率的排名原因 培训数据。 提供由Logistic回归排名的可能代码列表,因此可以是文本编码的工具中的有用功能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号