首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Query-Focused EHR Summarization to Aid Imaging Diagnosis
【24h】

Query-Focused EHR Summarization to Aid Imaging Diagnosis

机译:侧重于查询的EHR摘要,以援助成像诊断

获取原文
           

摘要

Electronic Health Records (EHRs) provide vital contextual information to radiologists and other physicians when making a diagnosis. Unfortunately, because a given patient’s record may contain hundreds of notes and reports, identifying relevant information within these in the short time typically allotted to a case is very difficult. We propose and evaluate models that extract relevant text snippets from patient records to provide a rough case summary intended to aid physicians considering one or more diagnoses. This is hard because direct supervision (i.e., physician annotations of snippets relevant to specific diagnoses in medical records) is prohibitively expensive to collect at scale. We propose a distantly supervised strategy in which we use groups of International Classification of Diseases (ICD) codes observed in ‘future’ records as noisy proxies for ‘downstream’ diagnoses. Using this we train a transformer-based neural model to perform extractive summarization conditioned on potential diagnoses. This model defines an attention mechanism that is conditioned on potential diagnoses (queries) provided by the diagnosing physician. We train (via distant supervision) and evaluate variants of this model on EHR data from Brigham and Women’s Hospital in Boston and MIMIC-III (the latter to facilitate reproducibility). Evaluations performed by radiologists demonstrate that these distantly supervised models yield better extractive summaries than do unsupervised approaches. Such models may aid diagnosis by identifying sentences in past patient reports that are clinically relevant to a potential diagnosis. Code is available at https://github.com/dmcinerney/ehr-extraction-models.
机译:电子健康记录(EHRS)在进行诊断时为放射科医生和其他医生提供重要的上下文信息。不幸的是,由于给定的患者的记录可能包含数百个音符和报告,因此在通常分配给案例的短时间内识别这些内部的相关信息非常困难。我们提出并评估从患者记录中提取相关文本片段的模型,以提供令人粗略的案例摘要,以帮助考虑一个或多个诊断的医生。这很难因为直接监督(即,医疗记录中特定诊断的片段的医师注释)在规模上收集昂贵。我们提出了一种远端监督策略,其中我们使用在“未来”记录中观察到的疾病(ICD)代码的国际分类群体作为“下游”诊断的嘈杂代理。使用此我们培训基于变压器的神经模型,以进行潜在诊断的调节概括。该模型定义了诊断医师提供的潜在诊断(查询)的关注机制。我们培训(通过遥远的监督),并在波士顿和MIMIC-III的Brigham和女子医院评估此模型的变体和MIMIC-III(后者以促进再现性)。放射科医生进行的评估表明,这些远方监督模型比无监督的方法产生更好的提取摘要。这种模型可以通过识别与潜在诊断相关的过去相关的患者报告中的句子来诊断。代码可在https://github.com/dmcinerney/ehr-extraction-models获得。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号