首页> 外文会议>Federated Conference on Computer Science and Information Systems >Automatic Generation of Annotated Corpora of Diagnoses with ICD-10 codes based on Open Data and Linked Open Data
【24h】

Automatic Generation of Annotated Corpora of Diagnoses with ICD-10 codes based on Open Data and Linked Open Data

机译:使用基于开放数据和链接的开放数据的ICD-10代码自动生成带注释的诊断语料库

获取原文

摘要

We propose methods for automatic generation of corpora that contains descriptions of diagnoses in Bulgarian and their associated codes in ICD-10-CM (International Classification of Diseases, 10th revision, Clinical Modification). The proposed approach is based on the available open data and Linked Open Data and can be easily adapted for other languages. The resulted corpora generated for the Bulgarian clinical texts consists of about 370,000 pairs of diagnoses and corresponding ICD-10 codes and is beyond the usual size that can be generated manually, moreover it was created from scratch and for a relatively short time. Further updates of the corpora are also possible whenever new open resources are available or the current ones are updated.
机译:我们提出了一种自动生成语料库的方法,该方法包含保加利亚语中的诊断说明以及ICD-10-CM(国际疾病分类,第10版,临床修改)中的相关代码。提议的方法基于可用的开放数据和链接的开放数据,并且可以轻松地适用于其他语言。为保加利亚临床文本生成的结果语料库包括大约370,000对诊断和相应的ICD-10代码,并且超出了可以手动生成的常规大小,而且它是从头开始创建的,而且使用时间相对较短。每当有新的开放资源可用或当前资源已更新时,语料库的进一步更新也是可能的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号