首页> 外文会议>Canadian conference on artificial intelligence >Unsupervised Extraction of Diagnosis Codes from EMRs Using Knowledge-Based and Extractive Text Summarization Techniques

【24h】

Unsupervised Extraction of Diagnosis Codes from EMRs Using Knowledge-Based and Extractive Text Summarization Techniques

机译：使用基于知识的和提取性文本摘要技术从EMR中无监督地提取诊断代码

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Diagnosis codes are extracted from medical records for billing and reimbursement and for secondary uses such as quality control and cohort identification. In the US, these codes come from the standard terminology ICD-9-CM derived from the international classification of diseases (ICD). ICD-9 codes are generally extracted by trained human coders by reading all artifacts available in a patient's medical record following specific coding guidelines. To assist coders in this manual process, this paper proposes an unsupervised ensemble approach to automatically extract ICD-9 diagnosis codes from textual narratives included in electronic medical records (EMRs). Earlier attempts on automatic extraction focused on individual documents such as radiology reports and discharge summaries. Here we use a more realistic dataset and extract ICD-9 codes from EMRs of 1000 inpatient visits at the University of Kentucky Medical Center. Using named entity recognition (NER), graph-based concept-mapping of medical concepts, and extractive text summarization techniques, we achieve an example based average recall of 0.42 with average precision 0.47; compared with a baseline of using only NER, we notice a 12% improvement in recall with the graph-based approach and a 7% improvement in precision using the extractive text summarization approach. Although diagnosis codes are complex concepts often expressed in text with significant long range non-local dependencies, our present work shows the potential of unsupervised methods in extracting a portion of codes. As such, our findings are especially relevant for code extraction tasks where obtaining large amounts of training data is difficult.

机译：从医疗记录中提取诊断代码以进行计费和报销，以及用于诸如质量控制和队列识别之类的辅助用途。在美国，这些代码来自于国际疾病分类（ICD）的标准术语ICD-9-CM。 ICD-9代码通常由受过训练的人类编码人员按照特定的编码指南，通过阅读患者病历中所有可用的工件来提取。为了帮助编码人员进行手动操作，本文提出了一种无监督的集成方法，可以从电子病历（EMR）中包含的文本叙述中自动提取ICD-9诊断代码。较早的自动提取尝试集中于单个文档，例如放射学报告和出院摘要。在这里，我们使用了更现实的数据集，并从肯塔基大学医学中心的1000例住院就诊的EMR中提取了ICD-9代码。使用命名实体识别（NER），基于图形的医学概念映射和提取文本摘要技术，我们实现了基于示例的平均召回率0.42和平均精度0.47;与仅使用NER的基线相比，我们注意到基于图的方法的查全率提高了12％，而使用提取文本摘要方法的查全率则提高了7％。尽管诊断代码是通常在文本中表达的复杂概念，并且具有很长的远程非本地依赖性，但我们目前的工作显示了在提取一部分代码时无监督方法的潜力。因此，我们的发现特别适用于难以获取大量训练数据的代码提取任务。

著录项

来源
《Canadian conference on artificial intelligence》|2013年|77-88|共12页
会议地点
作者
Ramakanth Kavuluru; Sifei Han; Daniel Harris;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
入库时间 2022-08-26 15:15:00

相似文献

外文文献
中文文献
专利

1. SummCoder: An unsupervised framework for extractive text summarization based on deep auto-encoders [J] . Joshi Akanksha, Fidalgo E., Alegre E., Expert Systems with Application . 2019,第SEPa期

机译：SummCoder：基于深度自动编码器的用于抽取文本摘要的无监督框架
2. SummCoder: An unsupervised framework for extractive text summarization based on deep auto-encoders [J] . Joshi Akanksha, Fidalgo E., Alegre E., Expert systems with applications . 2019,第Sepa期

机译：SimmoDer：基于深度自动编码器的提取文本摘要的无监督框架
3. Evaluation of Unsupervised Learning based Extractive Text Summarization Technique for Large Scale Review and Feedback Data [J] . Jai Prakash Verma, Atul Patel Indian Journal of Science and Technology . 2017,第17期

机译：基于大规模学习和反馈数据的基于无监督学习的提取文本摘要技术的评估
4. Unsupervised Extraction of Diagnosis Codes from EMRs Using Knowledge-Based and Extractive Text Summarization Techniques [C] . Ramakanth Kavuluru, Sifei Han, Daniel Harris Canadian conference on artificial intelligence . 2013

机译：使用基于知识和提取文本摘要技术的EMRS从EMRS的诊断代码提取
5. A Hierarchical Extractive Text Summarization Approach [D] . Alshahrani, Saud Shari. 2021

机译：分层提取文本摘要方法
6. Unsupervised Extraction of Diagnosis Codes from EMRs Using Knowledge-Based and Extractive Text Summarization Techniques [O] . Ramakanth Kavuluru, Sifei Han, Daniel Harris -1

机译：使用基于知识的和提取文本摘要技术从EMR中无监督地提取诊断代码
7. Text Summarization by Sentence Extraction Using Unsupervised Learning [O] . René Arnulfo García-hernández, Romyna Montiel, Yulia Ledeneva, 2008

机译：使用无监督学习的句子提取进行文本摘要

Unsupervised Extraction of Diagnosis Codes from EMRs Using Knowledge-Based and Extractive Text Summarization Techniques

摘要

著录项

相似文献

相关主题

期刊订阅