...
首页> 外文期刊>Nature protocols erecipes for researchers >High-throughput phenotyping with electronic medical record data using a common semi-supervised approach (PheCAP)
【24h】

High-throughput phenotyping with electronic medical record data using a common semi-supervised approach (PheCAP)

机译:使用常见的半监督方法(PHECAP)与电子医疗记录数据的高吞吐量表型

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Phenotypes are the foundation for clinical and genetic studies of disease risk and outcomes. The growth of biobanks linked to electronic medical record (EMR) data has both facilitated and increased the demand for efficient, accurate, and robust approaches for phenotyping millions of patients. Challenges to phenotyping with EMR data include variation in the accuracy of codes, as well as the high level of manual input required to identify features for the algorithm and to obtain gold standard labels. To address these challenges, we developed PheCAP, a high-throughput semi-supervised phenotyping pipeline. PheCAP begins with data from the EMR, including structured data and information extracted from the narrative notes using natural language processing (NLP). The standardized steps integrate automated procedures, which reduce the level of manual input, and machine learning approaches for algorithm training. PheCAP itself can be executed in 1-2 d if all data are available; however, the timing is largely dependent on the chart review stage, which typically requires at least 2 weeks. The final products of PheCAP include a phenotype algorithm, the probability of the phenotype for all patients, and a phenotype classification (yes or no).
机译:表型是疾病风险和结果的临床和遗传研究的基础。与电子医疗记录(EMR)数据相关的BioBanks的生长既促进,并增加了对数百万患者的高效,准确,鲁棒方法的需求。对EMR数据的表型挑战包括代码精度的变化,以及识别算法的特征所需的高水平手动输入,并获得金标标签。为解决这些挑战,我们开发了Phecap,一个高吞吐量半监督的表型管道。 PHECAP以EMR的数据开头,包括使用自然语言处理(NLP)从叙述笔记中提取的结构化数据和信息。标准化步骤集成了自动化程序,减少了手动输入的水平,以及用于算法训练的机器学习方法。如果所有数据都可用,则可以在1-2天中执行PHECAP本身;但是,时序在很大程度上取决于图表审查阶段,通常需要至少2周。 PHECAP的最终产品包括表型算法,所有患者的表型的概率,以及表型分类(是或否)。

著录项

  • 来源
  • 作者单位

    Harvard TH Chan Sch Publ Hlth Dept Biostat Boston MA USA;

    Brigham &

    Womens Hosp Div Rheumatol Immunol &

    Allergy 75 Francis St Boston MA 02115 USA;

    Tsinghua Univ Ctr Stat Sci Beijing Peoples R China;

    VA Boston Healthcare Syst Div Data Sci Boston MA 02130 USA;

    Harvard TH Chan Sch Publ Hlth Dept Biostat Boston MA USA;

    Harvard TH Chan Sch Publ Hlth Dept Biostat Boston MA USA;

    Brigham &

    Womens Hosp Div Rheumatol Immunol &

    Allergy 75 Francis St Boston MA 02115 USA;

    VA Boston Healthcare Syst Div Data Sci Boston MA 02130 USA;

    Massachusetts Gen Hosp Dept Gastroenterol Boston MA 02114 USA;

    Univ Pittsburgh Dept Neurol Pittsburgh PA 15260 USA;

    Brigham &

    Womens Hosp Div Cardiovasc Med 75 Francis St Boston MA 02115 USA;

    Partners Healthcare Res Informat Sci &

    Comp Boston MA USA;

    Partners Healthcare Res Informat Sci &

    Comp Boston MA USA;

    VA Boston Healthcare Syst Div Data Sci Boston MA 02130 USA;

    VA Boston Healthcare Syst Div Data Sci Boston MA 02130 USA;

    Brigham &

    Womens Hosp Div Rheumatol Immunol &

    Allergy 75 Francis St Boston MA 02115 USA;

    VA Boston Healthcare Syst Div Data Sci Boston MA 02130 USA;

    Brigham &

    Womens Hosp Div Rheumatol Immunol &

    Allergy 75 Francis St Boston MA 02115 USA;

    Brigham &

    Womens Hosp Div Rheumatol Immunol &

    Allergy 75 Francis St Boston MA 02115 USA;

    MIT Dept Elect Engn &

    Comp Sci Cambridge MA 02139 USA;

    Boston Childrens Hosp Computat Hlth Informat Program Boston MA USA;

    Harvard Med Sch Dept Biomed Informat Boston MA 02115 USA;

    VA Boston Healthcare Syst Div Data Sci Boston MA 02130 USA;

    Partners Healthcare Res Informat Sci &

    Comp Boston MA USA;

    VA Boston Healthcare Syst Div Data Sci Boston MA 02130 USA;

    Harvard TH Chan Sch Publ Hlth Dept Biostat Boston MA USA;

    Brigham &

    Womens Hosp Div Rheumatol Immunol &

    Allergy 75 Francis St Boston MA 02115 USA;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 生物科学;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号