首页> 外文期刊>JMIR Medical Informatics >Next Generation Phenotyping Using the Unified Medical Language System
【24h】

Next Generation Phenotyping Using the Unified Medical Language System

机译:使用统一医学语言系统的下一代表型

获取原文
获取外文期刊封面目录资料

摘要

Background Structured information within patient medical records represents a largely untapped treasure trove of research data. In the United States, privacy issues notwithstanding, this has recently become more accessible thanks to the increasing adoption of electronic health records (EHR) and health care data standards fueled by the Meaningful Use legislation. The other side of the coin is that it is now becoming increasingly more difficult to navigate the profusion of many disparate clinical terminology standards, which often span millions of concepts. Objective The objective of our study was to develop a methodology for integrating large amounts of structured clinical information that is both terminology agnostic and able to capture heterogeneous clinical phenotypes including problems, procedures, medications, and clinical results (such as laboratory tests and clinical observations). In this context, we define phenotyping as the extraction of all clinically relevant features contained in the EHR. Methods The scope of the project was framed by the Common Meaningful Use (MU) Dataset terminology standards; the Systematized Nomenclature of Medicine Clinical Terms (SNOMED CT), RxNorm, the Logical Observation Identifiers Names and Codes (LOINC), the Current Procedural Terminology (CPT), the Health care Common Procedure Coding System (HCPCS), the International Classification of Diseases Ninth Revision Clinical Modification (ICD-9-CM), and the International Classification of Diseases Tenth Revision Clinical Modification (ICD-10-CM). The Unified Medical Language System (UMLS) was used as a mapping layer among the MU ontologies. An extract, load, and transform approach separated original annotations in the EHR from the mapping process and allowed for continuous updates as the terminologies were updated. Additionally, we integrated all terminologies into a single UMLS derived ontology and further optimized it to make the relatively large concept graph manageable. Results The initial evaluation was performed with simulated data from the Clinical Avatars project using 100,000 virtual patients undergoing a 90 day, genotype guided, warfarin dosing protocol. This dataset was annotated with standard MU terminologies, loaded, and transformed using the UMLS. We have deployed this methodology to scale in our in-house analytics platform using structured EHR data for 7931 patients (12 million clinical observations) treated at the Froedtert Hospital. A demonstration limited to Clinical Avatars data is available on the Internet using the credentials user “jmirdemo” and password “jmirdemo”. Conclusions Despite its inherent complexity, the UMLS can serve as an effective interface terminology for many of the clinical data standards currently used in the health care domain.
机译:背景患者病历中的结构化信息代表了尚未开发的研究数据宝库。在美国,尽管存在隐私问题,但由于越来越多地采用电子健康记录(EHR)和“有意义使用”立法推动的医疗数据标准,这种情况最近变得更加容易获得。硬币的另一面是,如今越来越难以驾驭许多不同的临床术语标准,这些标准经常涉及数百万个概念。目的我们的研究目的是开发一种方法,用于整合大量结构化的临床信息,这些信息既与术语无关,又能够捕获异类临床表型,包括问题,程序,药物和临床结果(例如实验室检查和临床观察结果) 。在这种情况下,我们将表型定义为EHR中包含的所有临床相关特征的提取。方法该项目的范围由通用有意义使用(MU)数据集术语标准界定;临床医学术语系统化(SNOMED CT),RxNorm,逻辑观察标识符名称和代码(LOINC),当前程序术语(CPT),卫生保健通用程序编码系统(HCPCS),国际疾病分类第九名修订临床修改(ICD-9-CM)和国际疾病分类第十修订临床修改(ICD-10-CM)。统一医学语言系统(UMLS)被用作MU本体之间的映射层。提取,加载和转换方法将EHR中的原始注释与映射过程分开,并允许在术语更新时进行连续更新。此外,我们将所有术语集成到单个UMLS派生的本体中,并对其进行了进一步优化,以使相对较大的概念图易于管理。结果最初的评估是使用来自Clinical Avatars项目的模拟数据进行的,该研究使用了100,000名接受90天,基因型指导的华法林剂量方案的虚拟患者。使用标准MU术语对该数据集进行注释,然后使用UMLS对其进行加载和转换。我们已使用结构化的EHR数据为Froedtert医院治疗的7931名患者(1200万临床观察)部署了此方法,以在我们的内部分析平台中扩展规模。 Internet上使用凭证用户“ jmirdemo”和密码“ jmirdemo”提供了仅限于临床头像数据的演示。结论尽管存在固有的复杂性,但UMLS可以用作当前在医疗保健领域中使用的许多临床数据标准的有效接口术语。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号