...
首页> 外文期刊>International journal of medical informatics >Estimation of the prevalence of adverse drug reactions from social media
【24h】

Estimation of the prevalence of adverse drug reactions from social media

机译:估计社交媒体中药物不良反应的发生率

获取原文
获取原文并翻译 | 示例
           

摘要

This work aims to estimate the degree of adverse drug reactions (ADR) for psychiatric medications from social media, including Twitter, Reddit, and LiveJournal. Advances in lightning-fast cluster computing was employed to process large scale data, consisting of 6.4 terabytes of data containing 3.8 billion records from all the media. Rates of ADR were quantified using the SIDER database of drugs and side-effects, and an estimated ADR rate was based on the prevalence of discussion in the social media corpora. Agreement between these measures for a sample of ten popular psychiatric drugs was evaluated using the Pearson correlation coefficient, r, with values between 0.08 and 0.50. Word2vec, a novel neural learning framework, was utilized to improve the coverage of variants of ADR terms in the unstructured text by identifying syntactically or semantically similar terms. Improved correlation coefficients, between 0.29 and 0.59, demonstrates the capability of advanced techniques in machine learning to aid in the discovery of meaningful patterns from medical data, and social media data, at scale. (C) 2017 Elsevier B.V. All rights reserved.
机译:这项工作旨在通过社交媒体(包括Twitter,Reddit和LiveJournal)估计精神药物的不良药物反应(ADR)程度。利用快如闪电的集群计算技术来处理大规模数据,该数据由6.4 TB的数据组成,其中包含来自所有媒体的38亿条记录。使用药物和副作用的SIDER数据库量化了ADR的发生率,而估计的ADR发生率则是基于社交媒体语料库中讨论的普遍性。使用皮尔森相关系数r(值介于0.08和0.50之间)评估了十种流行精神药物样本在这些措施之间的一致性。 Word2vec是一种新型的神经学习框架,用于通过识别语法或语义上相似的术语来改善非结构化文本中ADR术语变体的覆盖范围。改进的相关系数在0.29至0.59之间,证明了机器学习中先进技术的能力,可以帮助从医学数据和社交媒体数据中大规模发现有意义的模式。 (C)2017 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号