首页> 外文会议>Annual International Conference of the IEEE Engineering in Medicine and Biology Society >UMLS mapping and Word embeddings for ICD code assignment using the MIMIC-III intensive care database
【24h】

UMLS mapping and Word embeddings for ICD code assignment using the MIMIC-III intensive care database

机译:使用MIMIC-III重症监护数据库的ICD代码分配UMLS映射和Word Embeddings

获取原文

摘要

Diagnosis codes are used as a billing mechanism in the Electronic Health Record and have the capability to benefit decision support systems, which aim to assist coders by suggesting a relevant subset of potential codes to choose from. Due to the large set of possible labels and length of patient records, automatic ICD code assignment is considered to be a challenging task within the field of multi-label classification. This paper introduces a baseline for automatic ICD code assignment using Support Vector Machines (SVM) and FastText with Unified Medical Language System (UMLS) metathesaurus mappings into word embedding models. Training data is obtained from the Medical Information Mart for Intensive Care (MIMIC-III) database and extended with ’is-a’ relationships from ICD-9 hierarchy. FastText is evaluated with different label count estimations, of which an approach based on label cardinality yields a F1-Score of 62.2%. FastText achieves high recall results and mentionable performance improvements over previous models. Reported values are obtained through 10-fold cross-validation.
机译:诊断代码用作电子健康记录中的计费机制,并具有受益决策支持系统的能力,该系统旨在通过建议选择相关潜在代码的相关子集。由于患者记录的一系列可能的标签和长度,自动ICD代码分配被认为是多标签分类领域内的具有挑战性的任务。本文介绍了使用支持向量机(SVM)和FastText的自动ICD代码分配的基线,并具有统一的医疗语言系统(UMLS)Metathesaurus映射到Word嵌入模型中。培训数据是从医疗信息MART获取的重症监护(MIMIC-III)数据库,并与ICD-9层次结构的关系延伸。 FastText进行了不同的标签计数估计,其中基于标签基数的方法产生F1分数为62.2%。 FastText实现了高召回结果,并在以前的型号上得到了增强的性能改进。报告的值是通过10倍交叉验证获得的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号