首页> 外文会议>IEEE International Conference on Bioinformatics and Biomedicine >Topic medical concept embedding: Multi-sense representation learning for medical concept
【24h】

Topic medical concept embedding: Multi-sense representation learning for medical concept

机译:主题医学概念嵌入:医学概念的多义表示学习

获取原文

摘要

Representation learning algorithm in medical area maps high dimensional real world medical concepts to low dimensional vector space, encodes rich medical knowledge, and has brought improvement to various machine learning applications in medical area. However, previous representation learning models in medical area failed to consider the multi-sense characteristic of medical concept. Moreover, the inner relationships between representations learned by previous model is implicit and can only be explained according to visualization, which means poor interpretability. In this paper, we propose Topic Medical Concept Embedding (TMCE), a generative embedding model to address above two problems. TMCE is able to learn multi-sense representations for a single medical concept, and TMCE can also improve interpretability by modeling relationships between each concept explicitly. In TMCE, multi-sense concept representations are influenced by its contexts and its topics. In addition, dosage information which is ignored by previous work are also utilized in TMCE. A MCMC method is presented to jointly learn the two-layer topic embeddings and multi-sense concept embeddings. Experimental results show that representations learned by TMCE outperforms those learned by other strong baselines by a large margin in a multi-label diagnose classification tasks. Several case studies further show that TMCE can learn medically correct multi-sense representations with better interpretability than other strong baselines.
机译:医学领域中的表示学习算法将高维现实世界医学概念映射到低维向量空间,对丰富的医学知识进行编码,并为医学领域中的各种机器学习应用带来了改进。但是,以往医学领域的表征学习模型未能考虑医学概念的多重感觉特征。此外,先前模型学习的表示之间的内部关系是隐式的,只能根据可视化进行解释,这意味着可解释性差。在本文中,我们提出了主题医学概念嵌入(TMCE),这是一种用于解决上述两个问题的生成性嵌入模型。 TMCE能够学习单个医学概念的多义表示,并且TMCE还可以通过显式地建模每个概念之间的关系来提高可解释性。在TMCE中,多义概念表示受其上下文和主题影响。另外,在TMCE中还利用了先前工作所忽略的剂量信息。提出了一种MCMC方法,用于联合学习两层主题嵌入和多感觉概念嵌入。实验结果表明,在多标签诊断分类任务中,TMCE所学习的表示形式比其他强基线所学习的表示形式要大得多。若干案例研究进一步表明,TMCE可以比其他强有力的基线更好地理解医学上正确的多义表示。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号