Identifying Personal Experience Tweets of Medication Effects Using Pre-trained RoBERTa Language Model and Its Updating

机译：使用预先培训的Roberta语言模型和更新识别个人体验的药物效果的推文

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Post-market surveillance, the practice of monitoring the safe use of pharmaceutical drugs is an important part of pharmacovigilance. Being able to collect personal experience related to pharmaceutical product use could help us gain insight into how the human body reacts to different medications. Twitter, a popular social media service, is being considered as an important alternative data source for collecting personal experience information with medications. Identifying personal experience tweets is a challenging classification task in natural language processing. In this study, we utilized three methods based on Facebook's Robustly Optimized BERT Pretraining Approach (RoBERTa) to predict personal experience tweets related to medication use: the first one combines the pre-trained RoBERTa model with a classifier, the second combines the updated pre-trained RoBERTa model using a corpus of unlabeled tweets with a classifier, and the third combines the RoBERTa model that was trained with our unlabeled tweets from scratch with the classifier too. Our results show that all of these approaches outperform the published methods (Word Embedding + LSTM) in classification performance (p < 0.05), and updating the pre-trained language model with tweets related to medications could even improve the performance further.

机译：市场后监测，监测安全使用制药药物的实践是药物检测的重要组成部分。能够收集与药品用途相关的个人经验可以帮助我们深入了解人体如何对不同的药物作出反应。 Twitter是一个受欢迎的社交媒体服务，被视为用于使用药物收集个人体验信息的重要替代数据源。识别个人体验推文是自然语言处理中有挑战性的分类任务。在这项研究中，我们利用了基于Facebook的强大优化BERT预先预订方法（Roberta）的三种方法来预测与药物使用相关的个人体验推文：第一个将预先培训的Roberta模型与分类器结合，第二个结合了更新的预先培训了roberta模型使用与分类器的未标记推文的语料库，第三个组合了roberta模型，这些模型与我们的未标记的推文也与分类器的划痕训练。我们的结果表明，所有这些方法都优于分类性能（P <0.05）中发布的方法（Word Embedding + LSTM），并使用与药物相关的推文更新预先接受的语言模型，甚至可以进一步提高性能。

著录项

来源
《International Workshop on Health Text Mining and Information Analysis》|2020年|127-137|共11页
会议地点
作者
Minghao Zhu; Youzhe Song; Ge Jin; Keyuan Jiang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Identifying tweets of personal health experience through word embedding and LSTM neural network [J] . Keyuan Jiang, Shichao Feng, Qunhao Song, BMC Bioinformatics . 2018,第8期

机译：通过词嵌入和LSTM神经网络识别个人健康经验的推文
2. I Tweet, You Tweet, (S)He Tweets: Enhancing the ESL Language-Learning Experience Through Twitter [J] . Geraldine Blattner, Amanda Dalola International Journal of Computer-Assisted Language Learning and Teaching . 2018,第2期

机译：我鸣叫，你鸣叫，（S）他鸣叫：通过Twitter增强ESL语言学习体验
3. Comparing pre-trained language models for Spanish hate speech detection [J] . Miriam Plaza-del-Arco Flor, Dolores Molina-Gonzalez M., Alfonso Urena-Lopez L., Expert systems with applications . 2021,第Mara期

机译：比较预先培训的语言模型，用于西班牙语仇恨语音检测
4. Identification of Medication Tweets Using Domain-specific Pre-trained Language Models [C] . Yandrapati Prakash Babu, Rajagopal Eswari Social Media Mining for Health Applications Workshop Shared Task;International Conference on Computational Linguistics . 2020

机译：使用域特定的预先培训的语言模型鉴定药物推文
5. A study identifying a relationship between years' experience and the perceptions of emergency department nurses towards factors causing medication administration errors. [D] . Branson, Steven James. 2011

机译：一项研究确定了多年经验与急诊科护士对导致药物管理错误的因素的看法之间的关系。
6. Identifying tweets of personal health experience through word embedding and LSTM neural network [O] . Keyuan Jiang, Shichao Feng, Qunhao Song, 2018

机译：通过词嵌入和LSTM神经网络识别个人健康经验的推文
7. NIT_COVID-19 at WNUT-2020 Task 2: Deep Learning Model RoBERTa for Identify Informative COVID-19 English Tweets [O] . Jagadeesh M S, Alphonse P J A 2020

机译：Nit_Covid-19在Wnut-2020任务2：深度学习模型Roberta识别信息Covid-19英语推文

Identifying Personal Experience Tweets of Medication Effects Using Pre-trained RoBERTa Language Model and Its Updating

摘要

著录项

相似文献

相关主题

期刊订阅