首页> 外文会议>European Conference on Information Retrieval >Drug and Disease Interpretation Learning with Biomedical Entity Representation Transformer
【24h】

Drug and Disease Interpretation Learning with Biomedical Entity Representation Transformer

机译:生物医学实体代表变压器的药物和疾病解释学习

获取原文

摘要

Concept normalization in free-form texts is a crucial step in every text-mining pipeline. Neural architectures based on Bidirectional Encoder Representations from Transformers (BERT) have achieved state-of-the-art results in the biomedical domain. In the context of drug discovery and development, clinical trials are necessary to establish the efficacy and safety of drugs. We investigate the effectiveness of transferring concept normalization from the general biomedical domain to the clinical trials domain in a zero-shot setting with an absence of labeled data. We propose a simple and effective two-stage neural approach based on fine-tuned BERT architectures. In the first stage, we train a metric learning model that optimizes relative similarity of mentions and concepts via triplet loss. The model is trained on available labeled corpora of scientific abstracts to obtain vector embeddings of concept names and entity mentions from texts. In the second stage, we find the closest concept name representation in an embedding space to a given clinical mention. We evaluated several models, including state-of-the-art architectures, on a dataset of abstracts and a real-world dataset of trial records with interventions and conditions mapped to drug and disease terminologies. Extensive experiments validate the effectiveness of our approach in knowledge transfer from the scientific literature to clinical trials.
机译:自由形式文本中的概念标准化是每个文本挖掘管道的重要步骤。基于来自变压器(BERT)的双向编码器表示的神经架构已经实现了生物医学领域的最新导致。在药物发现和发展的背景下,临床试验是制定药物的疗效和安全性必需的。我们调查从一般生物医学领域转移概念标准化的有效性,在零射击设置中,没有标记数据。我们提出了一种基于微调伯特架构的简单有效的两级神经方法。在第一阶段,我们训练通过三态损耗优化提升和概念的相对相似性。该模型培训了可用标有科学摘要的Corpora,以获取从文本的概念名称和实体提到的矢量嵌入。在第二阶段,我们发现将嵌入空间中最接近的概念名称表示到给定的临床提及。我们评估了几种模型,包括最先进的艺术品,在抽象的数据集和试验记录的现实世界数据集中,映射到药物和疾病术语的干预措施。广泛的实验验证了我们从科学文学到临床试验的知识转移中的方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号