首页> 外文会议>International Workshop on Health Text Mining and Information Analysis >Medical Concept Normalization in User-Generated Texts by Learning Target Concept Embeddings
【24h】

Medical Concept Normalization in User-Generated Texts by Learning Target Concept Embeddings

机译:通过学习目标概念嵌入用户生成的文本中的医学概念标准化

获取原文

摘要

Medical concept normalization helps in discovering standard concepts in free-form text i.e., maps health-related mentions to standard concepts in a clinical knowledge base. It is much beyond simple string matching and requires a deep semantic understanding of concept mentions. Recent research approach concept normalization as either text classification or text similarity. The main drawback in existing a) text classification approach is ignoring valuable target concepts information in learning input concept mention representation b) text similarity approach is the need to separately generate target concept embeddings which is time and resource consuming. Our proposed model overcomes these drawbacks by jointly learning the representations of input concept mention and target concepts. First, we learn input concept mention representation using RoBERTa. Second, we find cosine similarity between embeddings of input concept mention and all the target concepts. Here, embeddings of target concepts are randomly initialized and then updated during training. Finally, the target concept with maximum cosine similarity is assigned to the input concept mention. Our model surpasses all the existing methods across three standard datasets by improving accuracy up to 2.31%.
机译:医学概念标准化有助于在自由形式文本中发现标准概念,即地图与临床知识库中的标准概念进行地图。它超出了简单的字符串匹配,需要深入的语义理解概念提到。最近的研究方法概念标准化为文本分类或文本相似性。现有A)文本分类方法的主要缺点是忽略了学习输入概念的有价值的目标概念信息提到表示B)文本相似性方法是需要单独生成目标概念嵌入的时间和资源消耗。我们拟议的模式通过联合学习输入概念提及和目标概念的表示来克服了这些缺点。首先,我们学习使用Roberta提及表示的输入概念。其次,我们发现输入概念的嵌入和所有目标概念之间的嵌入之间的余弦相似之处。在这里,目标概念的嵌入是随机初始化的,然后在训练期间更新。最后,将具有最大余弦相似性的目标概念分配给输入概念提及。我们的模型通过提高高度为2.31%,超越三个标准数据集的所有现有方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号