首页> 美国卫生研究院文献>BMC Medical Informatics and Decision Making >Care episode retrieval: distributional semantic models for information retrieval in the clinical domain
【2h】

Care episode retrieval: distributional semantic models for information retrieval in the clinical domain

机译:护理发作检索:用于临床领域信息检索的分布式语义模型

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Patients' health related information is stored in electronic health records (EHRs) by health service providers. These records include sequential documentation of care episodes in the form of clinical notes. EHRs are used throughout the health care sector by professionals, administrators and patients, primarily for clinical purposes, but also for secondary purposes such as decision support and research. The vast amounts of information in EHR systems complicate information management and increase the risk of information overload. Therefore, clinicians and researchers need new tools to manage the information stored in the EHRs. A common use case is, given a - possibly unfinished - care episode, to retrieve the most similar care episodes among the records. This paper presents several methods for information retrieval, focusing on care episode retrieval, based on textual similarity, where similarity is measured through domain-specific modelling of the distributional semantics of words. Models include variants of random indexing and the semantic neural network model word2vec. Two novel methods are introduced that utilize the ICD-10 codes attached to care episodes to better induce domain-specificity in the semantic model. We report on experimental evaluation of care episode retrieval that circumvents the lack of human judgements regarding episode relevance. Results suggest that several of the methods proposed outperform a state-of-the art search engine (Lucene) on the retrieval task.
机译:患者的健康相关信息由健康服务提供商存储在电子健康记录(EHR)中。这些记录包括临床记录形式的顺序护理记录。专业人士,管理人员和患者将EHR用于整个医疗保健部门,主要用于临床目的,也可用于辅助目的,例如决策支持和研究。 EHR系统中的大量信息使信息管理复杂化,并增加了信息过载的风险。因此,临床医生和研究人员需要新的工具来管理EHR中存储的信息。给定一个(可能未完成的)护理情节,一个常见的用例是检索记录中最相似的护理情节。本文提出了几种信息检索方法,重点是基于文本相似性的护理情节检索,其中相似性是通过单词的分布语义的特定领域建模来衡量的。模型包括随机索引的变体和语义神经网络模型word2vec。引入了两种新颖的方法,这些方法利用附加到护理事件的ICD-10代码更好地在语义模型中诱导域特异性。我们报告了对护理发作检索的实验评估,该试验规避了缺乏关于发作相关性的人类判断。结果表明,在检索任务方面,所提出的几种方法优于最新的搜索引擎(Lucene)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号