【24h】

UACH at SMM4H: a BERT based approach for classification of COVID-19 Twitter posts

机译:SMA4H UACH:基于伯特的COVID-19推特分类方法

获取原文

摘要

This work describes the participation of the Universidad Autonoma de Chihuahua team at the Social Media Mining for Health Applications (SMM4H) 2021 shared task. Our team participated in Tasks 5 and 6, both focused on the automatic classification of tweets related to COVID-19. Task 5 considered a binary classification problem, aiming to identify self-reporting tweets of potential cases of COVID-19. On the other hand. Task 6 goal was to classify tweets containing COVID-19 symptoms. For both tasks we used models based on bidirectional encoder representations from transformers (BERT). Our objective was to determine whether a model trained on a corpus from the domain of interest could outperformed one trained on a much larger general domain corpus. Our F1 results were encouraging, 0.77 and 0.95 for Tasks 5 and 6 respectively, having achieved the highest score among all the participants in the latter.
机译:这项工作描述了Unrimediad AutoCAD奇瓦瓦团队在社交媒体挖掘中的健康应用(SMM4H)2021共享任务的参与。我们的团队参与了任务5和6,他们都专注于与COVID-19相关的鸣叫的自动分类。任务5考虑了2019冠状病毒疾病的二分类问题,旨在识别潜在的COVID-19病例的报告。另一方面任务6的目标2019冠状病毒疾病的分类。对于这两项任务,我们都使用了基于变压器(BERT)双向编码器表示的模型。我们的目标是确定在感兴趣领域的语料库上训练的模型是否比在更大的一般领域语料库上训练的模型表现更好。我们的F1成绩令人鼓舞,任务5和任务6的成绩分别为0.77和0.95,在后者的所有参与者中得分最高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号