首页> 美国卫生研究院文献>AMIA Summits on Translational Science Proceedings >Automatically Identifying Comparator Groups on Twitter for Digital Epidemiology of Pregnancy Outcomes
【2h】

Automatically Identifying Comparator Groups on Twitter for Digital Epidemiology of Pregnancy Outcomes

机译:在Twitter上自动识别比较组以了解妊娠结局的数字流行病学

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Despite the prevalence of adverse pregnancy outcomes such as miscarriage, stillbirth, birth defects, and preterm birth, their causes are largely unknown. We seek to advance the use of social media for observational studies of pregnancy outcomes by developing a natural language processing pipeline for automatically identifying users from which to select comparator groups on Twitter. We annotated 2361 tweets by users who have announced their pregnancy on Twitter, which were used to train and evaluate supervised machine learning algorithms as a basis for automatically detecting women who have reported that their pregnancy had reached term and their baby was born at a normal weight. Upon further processing the tweet-level predictions of a majority voting-based ensemble classifier, the pipeline achieved a user-level F1-score of 0.933 (precision = 0.947, recall = 0.920). Our pipeline will be deployed to identify large comparator groups for studying pregnancy outcomes on Twitter.
机译:尽管普遍存在不利的妊娠结局,例如流产,死产,先天缺陷和早产,但其成因很大程度上未知。我们通过开发一种自然语言处理管道来自动识别在Twitter上选择比较者组的用户,从而寻求将社交媒体用于妊娠结局的观察研究。我们为在Twitter上宣布怀孕的用户添加了2361条推文,这些推文被用来训练和评估监督的机器学习算法,以此作为自动检测报告其妊娠已满并且婴儿出生体重正常的女性的基础。在进一步处理基于多数投票的集成分类器的推特级别的预测后,管道实现了0.933的用户级别F1分数(精度= 0.947,召回率= 0.920)。我们的渠道将被部署以识别大型比较者组,以便在Twitter上研究妊娠结局。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号