...
首页> 外文期刊>PLoS One >COS: A new MeSH term embedding incorporating corpus, ontology, and semantic predications
【24h】

COS: A new MeSH term embedding incorporating corpus, ontology, and semantic predications

机译:COS:一种新的网格术语嵌入包含语料库,本体和语义预测

获取原文
   

获取外文期刊封面封底 >>

       

摘要

The embedding of Medical Subject Headings (MeSH) terms has become a foundation for many downstream bioinformatics tasks. Recent studies employ different data sources, such as the corpus (in which each document is indexed by a set of MeSH terms), the MeSH term ontology, and the semantic predications between MeSH terms (extracted by SemMedDB), to learn their embeddings. While these data sources contribute to learning the MeSH term embeddings, current approaches fail to incorporate all of them in the learning process. The challenge is that the structured relationships between MeSH terms are different across the data sources, and there is no approach to fusing such complex data into the MeSH term embedding learning. In this paper, we study the problem of incorporating corpus, ontology, and semantic predications to learn the embeddings of MeSH terms. We propose a novel framework, Corpus, Ontology, and Semantic predications-based MeSH term embedding (COS), to generate high-quality MeSH term embeddings. COS converts the corpus, ontology, and semantic predications into MeSH term sequences, merges these sequences, and learns MeSH term embeddings using the sequences. Extensive experiments on different datasets show that COS outperforms various baseline embeddings and traditional non-embedding-based baselines.
机译:医疗主题标题(网格)术语的嵌入已成为许多下游生物信息学的基础。最近的研究采用不同的数据源,例如语料库(每个文档由一组网格术语索引),网格术语本体和网格术语之间的语义预测(由SemMeddB提取),以学习其嵌入。虽然这些数据源有助于学习网格术语嵌入,但目前的方法无法将所有方法纳入学习过程中。挑战是网格术语之间的结构化关系在数据源上是不同的,并且不存在将这些复杂数据融合到网格术语嵌入学习中的方法。在本文中,我们研究了掺入语料库,本体论和语义预测的问题,以学习网格术语的嵌入。我们提出了一种新颖的框架,语料库,本体论和语义预测的基于语义嵌入(COS),以产生高质量的网格术语嵌入。 COS将语料库,本体和语义预测转换为网格术语序列,合并这些序列,并使用序列学习网格术语eMbeddings。对不同数据集的广泛实验表明COS优于各种基线嵌入和传统的非嵌入基基线。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号