首页> 外文期刊>JMIR Medical Informatics >Implementation of a Cohort Retrieval System for Clinical Data Repositories Using the Observational Medical Outcomes Partnership Common Data Model: Proof-of-Concept System Validation
【24h】

Implementation of a Cohort Retrieval System for Clinical Data Repositories Using the Observational Medical Outcomes Partnership Common Data Model: Proof-of-Concept System Validation

机译:使用观察医疗结果合作数据模型的临床数据存储库的临床数据存储库的实施:概念证明系统验证

获取原文
           

摘要

Background Widespread adoption of electronic health records has enabled the secondary use of electronic health record data for clinical research and health care delivery. Natural language processing techniques have shown promise in their capability to extract the information embedded in unstructured clinical data, and information retrieval techniques provide flexible and scalable solutions that can augment natural language processing systems for retrieving and ranking relevant records. Objective In this paper, we present the implementation of a cohort retrieval system that can execute textual cohort selection queries on both structured data and unstructured text—Cohort Retrieval Enhanced by Analysis of Text from Electronic Health Records (CREATE). Methods CREATE is a proof-of-concept system that leverages a combination of structured queries and information retrieval techniques on natural language processing results to improve cohort retrieval performance using the Observational Medical Outcomes Partnership Common Data Model to enhance model portability. The natural language processing component was used to extract common data model concepts from textual queries. We designed a hierarchical index to support the common data model concept search utilizing information retrieval techniques and frameworks. Results Our case study on 5 cohort identification queries, evaluated using the precision at 5 information retrieval metric at both the patient-level and document-level, demonstrates that CREATE achieves a mean precision at 5 of 0.90, which outperforms systems using only structured data or only unstructured text with mean precision at 5 values of 0.54 and 0.74, respectively. Conclusions The implementation and evaluation of Mayo Clinic Biobank data demonstrated that CREATE outperforms cohort retrieval systems that only use one of either structured data or unstructured text in complex textual cohort queries.
机译:背景技术通过电子健康记录的广泛采用使电子健康记录数据的二次使用进行临床研究和医疗保健。自然语言处理技术已经显示在其能力中提取嵌入在非结构化临床数据中的信息的能力,信息检索技术提供了灵活且可扩展的解决方案,可以增加用于检索和排序相关记录的自然语言处理系统。目的在本文中,我们介绍了队列检索系统的实现,可以通过从电子健康记录(Create)的文本分析来执行关于结构化数据和非结构化文本队队队员的文本队列选择查询的队列检索系统。方法创建是一个概念验证系统,它利用结构化查询和信息检索技术的组合,用于使用伙伴关系普通数据模型来改善群组检索性能,以提高模型便携性。自然语言处理组件用于从文本查询中提取常见的数据模型概念。我们设计了一个分层索引,以支持使用信息检索技术和框架的公共数据模型概念搜索。结果我们对5个队列识别查询的案例研究,在患者级和文档级别的5个信息检索度量下使用精度评估,表明,创建在0.90的5/0的平均精度下实现,其仅使用结构化数据或只有非结构化文本,平均精度为5值,分别为0.54和0.74。结论Mayo诊所BioBank数据的实施和评估表明,创建胜过伙计检索系统,只能在复杂的文本队列查询中使用结构化数据或非结构化文本中的一个。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号