【24h】

Overview of the Fourth Social Media Mining for Health (#SMM4H) Shared Task at ACL 2019

机译:ACL 2019第四次健康社交媒体挖掘(#SMM4H)共享任务概述

获取原文

摘要

The numher of users of social media continues to grow, with nearly half of adults worldwide and two-thirds of all American adults using social networking on a regular basis~1. Advances in automated data processing and NLP present the possibility of utilizing this massive data source for biomedical and public health applications, if researchers address the methodological challenges unique to this media. We present the Social Media Mining for Health Shared Tasks collocated with the ACL at Florence in 2019, which address these challenges for health monitoring and surveillance, utilizing state of the art techniques for processing noisy, real-world, and substantially creative language expressions from social media users. For the fourth execution of this challenge, we proposed four different tasks. Task 1 asked participants to distinguish tweets reporting an adverse drug reaction (ADR) from those that do not. Task 2, a follow-up to Task 1, asked participants to identify the span of text in tweets reporting ADRs. Task 3 is an end-to-end task where the goal was to first detect tweets mentioning an ADR and then map the extracted colloquial mentions of ADRs in the tweets to their corresponding standard concept IDs in the MedDRA vocabulary. Finally. Task 4 asked participants to classify whether a tweet contains a personal mention of one's health, a more general discussion of the health issue, or is an unrelated mention. A total of 34 teams from around the world registered and 19 teams from 12 countries submitted a system run. We summarize here the corpora for this challenge which are freely available at https://competitions.codalab. org/competitions/22521, and present an overview of the methods and the results of the competing systems.
机译:社交媒体用户的数量继续增长,全球近一半的成年人和三分之二的美国成年人定期使用社交网络[1]。如果研究人员解决了这种媒体特有的方法挑战,那么自动化数据处理和NLP的进步就意味着可以将这种庞大的数据源用于生物医学和公共卫生应用。我们将展示与ACL于2019年在佛罗伦萨联合部署的用于健康共享任务的社交媒体挖掘,这些任务将利用最先进的技术来处理社交网络中的嘈杂,真实世界和实质性创造性的语言表达,从而应对健康监控和监视方面的这些挑战媒体用户。对于此挑战的第四次执行,我们提出了四个不同的任务。任务1要求参与者将报告不良药物反应(ADR)的推文与未报告的推文区分开。任务2是任务1的后续活动,要求参与者确定报告ADR的推文中的文本范围。任务3是端到端的任务,目标是首先检测提及ADR的推文,然后将推文中提取的ADR口语提述映射到MedDRA词汇表中的相应标准概念ID。最后。任务4要求参与者对一条推文进行分类,其中一条推文是否包含个人对健康的提及,对健康问题的更一般性讨论,还是不相关的提及。来自全球的34个团队进行了注册,来自12个国家的19个团队进行了系统运行。我们在这里总结了此挑战的语料库,可从https://competitions.codalab免费获得。 org / competitions / 22521,并概述了竞争系统的方法和结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号