首页> 外文会议>Workshop on Chinese Lexical Semantics >Semantic Representations of Terms in Traditional Chinese Medicine
【24h】

Semantic Representations of Terms in Traditional Chinese Medicine

机译:中医中术语的语义表示

获取原文

摘要

Word embeddings have been widely used in lexical semantics and neural networks in Natural Language Processing. This article investigates the semantic representations using word embedding technologies by verifying them on a human constructed domain ontology. The domain of Traditional Chinese Medicine (TCM) is used as a workbench in this study, because this domain is knowledge-rich and has a large-scale domain ontology with well-defined entity types and relation types. This article releases a dataset, named "TCMSem", to capture TCM domain experts' intuitions of semantic relatedness. This data set is designed to cover the medical entities and relations with as many semantic types as possible so as to initiate a diverse and comprehensive evaluation on word embeddings. Experimental results show that word embeddings have demonstrated higher proficiencies in the detection of synonyms and collocations than other types of semantic relations. Furthermore, the semantic relatedness of thousands of terms of major categories in TCM is visualized using the taxonomy defined in the ontology.
机译:Word Embeddings已被广泛用于自然语言处理中的词汇语义和神经网络。本文通过在人类构建的域本体上验证它们来调查使用Word Embedding Technologies的语义表示。中药(TCM)的领域用作本研究中的工作台,因为该域名是知识丰富的,并且具有具有明确定义的实体类型和关系类型的大规模域本体。本文释放了一个名为“TCMSEM”的数据集,以捕获TCM域专家的语义相关性。该数据集旨在涵盖与尽可能多的语义类型的医学实体和关系,以便在Word Embeddings上发起多样化和综合评估。实验结果表明,嵌入词在检测到比其他类型的语义关系的同义词和搭配中展示了更高的突发性。此外,使用本体中定义的分类学,TCM中成千上万的主要类别的语义相关性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号