首页> 外文期刊>JMIR Medical Informatics >Adverse Drug Event Discovery Using Biomedical Literature: A Big Data Neural Network Adventure
【24h】

Adverse Drug Event Discovery Using Biomedical Literature: A Big Data Neural Network Adventure

机译:利用生物医学文献进行不良药物事件发现:大数据神经网络历险记

获取原文
       

摘要

Background The study of adverse drug events (ADEs) is a tenured topic in medical literature. In recent years, increasing numbers of scientific articles and health-related social media posts have been generated and shared daily, albeit with very limited use for ADE study and with little known about the content with respect to ADEs. Objective The aim of this study was to develop a big data analytics strategy that mines the content of scientific articles and health-related Web-based social media to detect and identify ADEs. Methods We analyzed the following two data sources: (1) biomedical articles and (2) health-related social media blog posts. We developed an intelligent and scalable text mining solution on big data infrastructures composed of Apache Spark, natural language processing, and machine learning. This was combined with an Elasticsearch No-SQL distributed database to explore and visualize ADEs. Results The accuracy, precision, recall, and area under receiver operating characteristic of the system were 92.7%, 93.6%, 93.0%, and 0.905, respectively, and showed better results in comparison with traditional approaches in the literature. This work not only detected and classified ADE sentences from big data biomedical literature but also scientifically visualized ADE interactions. Conclusions To the best of our knowledge, this work is the first to investigate a big data machine learning strategy for ADE discovery on massive datasets downloaded from PubMed Central and social media. This contribution illustrates possible capacities in big data biomedical text analysis using advanced computational methods with real-time update from new data published on a daily basis.
机译:背景技术药物不良事件(ADEs)的研究是医学文献中的一个长期话题。近年来,每天生成和共享越来越多的科学文章和与健康有关的社交媒体帖子,尽管在ADE研究中使用的用途非常有限,并且对ADE的内容知之甚少。目的这项研究的目的是开发一种大数据分析策略,以挖掘科学文章和与健康相关的基于Web的社交媒体的内容,以检测和识别ADE。方法我们分析了以下两个数据源:(1)生物医学文章和(2)与健康相关的社交媒体博客文章。我们在由Apache Spark,自然语言处理和机器学习组成的大数据基础架构上开发了一种智能且可扩展的文本挖掘解决方案。结合使用Elasticsearch No-SQL分布式数据库来探索和可视化ADE。结果该系统的准确度,精确度,召回率和接收器工作面积分别为92.7%,93.6%,93.0%和0.905,与传统方法相比,显示出更好的结果。这项工作不仅从大数据生物医学文献中检测和分类了ADE句子,而且还科学地可视化了ADE交互作用。结论据我们所知,这项工作是第一个研究大数据机器学习策略的方法,该策略用于从PubMed Central和社交媒体下载的海量数据集上进行ADE发现。该贡献说明了使用先进的计算方法并根据每天发布的新数据进行实时更新的大数据生物医学文本分析的可能能力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号