...
首页> 外文期刊>Journal of the American Medical Informatics Association : >Impact of data fragmentation across healthcare centers on the accuracy of a high-throughput clinical phenotyping algorithm for specifying subjects with type 2 diabetes mellitus
【24h】

Impact of data fragmentation across healthcare centers on the accuracy of a high-throughput clinical phenotyping algorithm for specifying subjects with type 2 diabetes mellitus

机译:跨医疗中心的数据碎片化对高通量临床表型算法准确性的影响,该算法用于指定患有2型糖尿病的受试者

获取原文
获取原文并翻译 | 示例

摘要

Objective: To evaluate data fragmentation across healthcare centers with regard to the accuracy of a highthroughput clinical phenotyping (HTCP) algorithm developed to differentiate (1) patients with type 2 diabetes mellitus (T2DM) and (2) patients with no diabetes. Materials and methods: This population-based study identified all Olmsted County, Minnesota residents in 2007. We used provider-linked electronic medical record data from the two healthcare centers that provide >95% of all care to County residents (ie, Olmsted Medical Center and Mayo Clinic in Rochester, Minnesota, USA). Subjects were limited to residents with one or more encounter January 1, 2006 through December 31, 2007 at both healthcare centers. DM-relevant data on diagnoses, laboratory results, and medication from both centers were obtained during this period. The algorithm was first executed using data from both centers (ie, the gold standard) and then from Mayo Clinic alone. Positive predictive values and false-negative rates were calculated, and the McNemar test was used to compare categorization when data from the Mayo Clinic alone were used with the gold standard. Age and sex were compared between true-positive and false-negative subjects with T2DM. Statistical significance was accepted as p<0.05. Results: With data from both medical centers, 765 subjects with T2DM (4256 non-DM subjects) were identified. When single-center data were used, 252 T2DM subjects (1573 non-DM subjects) were missed; an additional false-positive 27 T2DM subjects (215 non-DM subjects) were identified. The positive predictive values and false-negative rates were 95.0% (513/540) and 32.9% (252/765), respectively, for T2DM subjects and 92.6% (2683/2898) and 37.0% (1573/4256), respectively, for non-DM subjects. Age and sex distribution differed between true-positive (mean age 62.1; 45% female) and false-negative (mean age 65.0; 56.0% female) T2DM subjects. Conclusion: The findings show that application of an HTCP algorithm using data from a single medical center contributes to misclassification. These findings should be considered carefully by researchers when developing and executing HTCP algorithms.
机译:目的:就高通量临床表型(HTCP)算法的准确性评估整个医疗中心的数据碎片,该算法可区分(1)2型糖尿病(T2DM)患者和(2)无糖尿病患者。资料和方法:这项基于人群的研究确定了2007年明尼苏达州奥姆斯特德县的所有居民。我们使用了来自两个医疗中心的提供者链接的电子病历数据,这两个医疗中心为县居民提供了超过95%的所有护理(即,奥姆斯特德医学中心美国明尼苏达州罗彻斯特市的梅奥诊所(Mayo Clinic)。受试者仅限于2006年1月1日至2007年12月31日在两个医疗中心遇到一次或多次接触的居民。在此期间,从这两个中心获得了与DM相关的诊断,实验室结果和用药方面的数据。该算法首先使用两个中心的数据(即黄金标准)执行,然后再使用Mayo Clinic的数据执行。计算了阳性预测值和假阴性率,当仅将梅奥诊所的数据与黄金标准一起使用时,使用McNemar检验比较分类。比较了患有T2DM的真阳性和假阴性受试者的年龄和性别。统计学显着性被接受为p <0.05。结果:利用来自两个医疗中心的数据,共鉴定了765名T2DM受试者(4256个非DM受试者)。使用单中心数据时,错过了252个T2DM受试者(1573个非DM受试者);确定了另外的假阳性27名T2DM受试者(215名非DM受试者)。对于T2DM受试者,阳性预测值和假阴性率分别为95.0%(513/540)和32.9%(252/765),分别为92.6%(2683/2898)和37.0%(1573/4256),对于非DM主题。 T2DM受试者的真阳性(平均年龄62.1;女性45%)和假阴性(平均年龄65.0;女性56.0%)之间的年龄和性别分布有所不同。结论:研究结果表明,使用来自单个医疗中心的数据的HTCP算法的应用会导致分类错误。研究人员在开发和执行HTCP算法时应仔细考虑这些发现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号