首页> 外文会议>IEEE International Conference on Bioinformatics and Biomedicine >An automatic approach for constructing a knowledge base of symptoms in Chinese
【24h】

An automatic approach for constructing a knowledge base of symptoms in Chinese

机译:一种自动构建中文症状知识库的方法

获取原文

摘要

While a large number of well-known knowledge bases (KBs) in life science have been published as Linked Open Data, there are few KBs in Chinese. However, KBs of life science in Chinese are necessary when we want to automatically process and analyze electronic medical records (EMRs) in Chinese. Of all, the symptom KB in Chinese is the most seriously in need, since symptoms are the starting point of clinical diagnosis. Furthermore, expressions used in describing symptoms in clinical practice are diverse, which makes it hard to collect such a KB. In this paper, we publish a public KB of symptoms in Chinese. The KB is constructed by fusing data automatically extracted from eight mainstream healthcare websites, three Chinese encyclopedia sites, and symptoms extracted from a large number of EMRs as supplements. As a result, the KB has more than 26,000 distinct symptoms in Chinese including 3,968 symptoms in traditional Chinese medicine (TCM) and 1,029 synonym pairs for symptoms. The KB also includes concepts such as diseases and medicines as well as relations between symptoms and the above related entities. We also link our KB to the Unified Medical Language System (UMLS) and analyze the differences between symptoms in the two KBs. We released the KB as Linked Open Data and a demo at https://datahub.io/dataset/symptoms-in-chinese.
机译:虽然许多生命科学领域的著名知识库(KB)已作为链接开放数据发布,但中文的KB很少。但是,当我们要自动处理和分析中文电子病历(EMR)时,必须使用中文生命科学知识库。最重要的是,中文症状KB是最需要的,因为症状是临床诊断的起点。此外,在临床实践中用于描述症状的表达是多种多样的,这使得很难收集这样的KB。在本文中,我们发布了中文症状的公共知识库。知识库是通过融合从8个主流医疗保健网站,3个中国百科全书网站自动提取的数据以及从大量EMR中提取的症状作为补充而构建的。结果,知识库在中文中有26,000多种不同的症状,包括3,968种中药(TCM)症状和1,029种症状的同义词对。知识库还包括疾病和药物等概念,以及症状与上述相关实体之间的关系。我们还将知识库链接到统一医学语言系统(UMLS),并分析两个知识库中症状之间的差异。我们在https://datahub.io/dataset/symptoms-in-chinese中发布了KB作为链接的开放数据和一个演示。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号