首页> 美国卫生研究院文献>Genomics Informatics >PharmacoNER Tagger: a deep learning-based tool for automatically finding chemicals and drugs in Spanish medical texts
【2h】

PharmacoNER Tagger: a deep learning-based tool for automatically finding chemicals and drugs in Spanish medical texts

机译:PharmacoNER Tagger:一种基于深度学习的工具可在西班牙医学文本中自动查找化学药品和药物

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Automatically detecting mentions of pharmaceutical drugs and chemical substances is key for the subsequent extraction of relations of chemicals with other biomedical entities such as genes, proteins, diseases, adverse reactions or symptoms. The identification of drug mentions is also a prior step for complex event types such as drug dosage recognition, duration of medical treatments or drug repurposing. Formally, this task is known as named entity recognition (NER), meaning automatically identifying mentions of predefined entities of interest in running text. In the domain of medical texts, for chemical entity recognition (CER), techniques based on hand-crafted rules and graph-based models can provide adequate performance. In the recent years, the field of natural language processing has mainly pivoted to deep learning and state-of-the-art results for most tasks involving natural language are usually obtained with artificial neural networks. Competitive resources for drug name recognition in English medical texts are already available and heavily used, while for other languages such as Spanish these tools, although clearly needed were missing. In this work, we adapt an existing neural NER system, NeuroNER, to the particular domain of Spanish clinical case texts, and extend the neural network to be able to take into account additional features apart from the plain text. NeuroNER can be considered a competitive baseline system for Spanish drug and CER promoted by the Spanish national plan for the advancement of language technologies (Plan TL).
机译:自动检测药物和化学物质的提法对于随后提取化学物质与其他生物医学实体(例如基因,蛋白质,疾病,不良反应或症状)之间的关系至关重要。对于复杂事件类型,例如药物剂量识别,药物治疗持续时间或药物再利用,识别药物提及也是先前的步骤。正式地,此任务称为命名实体识别(NER),即自动识别运行文本中感兴趣的预定义实体的提及。在医学文本领域,对于化学实体识别(CER),基于手工规则和基于图形的模型的技术可以提供足够的性能。近年来,自然语言处理领域主要转向了深度学习,大多数涉及自然语言的任务的最新结果通常是通过人工神经网络获得的。英文医学文本中用于药物名称识别的竞争性资源已经可用并大量使用,而对于其他语言(如西班牙文),这些工具虽然显然没有必要,但却缺少。在这项工作中,我们将现有的神经NER系统NeuroNER改编为西班牙临床案例文本的特定领域,并扩展了神经网络以能够考虑除纯文本之外的其他功能。西班牙国家语言技术发展计划(计划TL)倡导将NeuroNER视为西班牙药品和CER的竞争基准系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号