...
首页> 外文期刊>BMJ Open >Data linkage errors in hospital administrative data when applying a pseudonymisation algorithm to paediatric intensive care records
【24h】

Data linkage errors in hospital administrative data when applying a pseudonymisation algorithm to paediatric intensive care records

机译:将假名化算法应用于儿科重症监护记录时,医院管理数据中的数据链接错误

获取原文
           

摘要

Objectives Our aim was to estimate the rate of data linkage error in Hospital Episode Statistics (HES) by testing the HESID pseudoanonymisation algorithm against a reference standard, in a national registry of paediatric intensive care records. Setting The Paediatric Intensive Care Audit Network (PICANet) database, covering 33 paediatric intensive care units in England, Scotland and Wales. Participants Data from infants and young people aged 0–19?years admitted between 1 January 2004 and 21 February 2014. Primary and secondary outcome measures PICANet admission records were classified as matches (records belonging to the same patient who had been readmitted) or non-matches (records belonging to different patients) after applying the HESID algorithm to PICANet records. False-match and missed-match rates were calculated by comparing results of the HESID algorithm with the reference standard PICANet ID. The effect of linkage errors on readmission rate was evaluated. Results Of 166?406 admissions, 88?596 were true matches (where the same patient had been readmitted). The HESID pseudonymisation algorithm produced few false matches (n=176/77?810; 0.2%) but a larger proportion of missed matches (n=3609/88?596; 4.1%). The true readmission rate was underestimated by 3.8% due to linkage errors. Patients who were younger, male, from Asian/Black/Other ethnic groups (vs White) were more likely to experience a false match. Missed matches were more common for younger patients, for Asian/Black/Other ethnic groups (vs White) and for patients whose records had missing data. Conclusions The deterministic algorithm used to link all episodes of hospital care for the same patient in England has a high missed match rate which underestimates the true readmission rate and will produce biased analyses. To reduce linkage error, pseudoanonymisation algorithms need to be validated against good quality reference standards. Pseudonymisation of data ‘at source’ does not itself address errors in patient identifiers and the impact these errors have on data linkage.
机译:目的我们的目的是通过在国家儿科重症监护记录中对照参考标准测试HESID伪匿名算法来估计医院情节统计(HES)中数据链接错误的发生率。设置儿科重症监护审核网络(PICANet)数据库,涵盖英格兰,苏格兰和威尔士的33个儿科重症监护病房。参与者来自2004年1月1日至2014年2月21日之间的0-19岁婴幼儿数据。主要和次要指标PICANet入院记录分为匹配项(属于已再次入院的同一患者的记录)或非匹配项。在将HESID算法应用于PICANet记录后进行匹配(属于不同患者的记录)。通过将HESID算法的结果与参考标准PICANet ID进行比较,可以计算出不匹配率和不匹配率。评估了连锁错误对再入院率的影响。结果166〜406例入院病例中,88〜596例为真匹配(同一患者已被重新入院)。 HESID假名化算法产生的错误匹配很少(n = 176/77?810; 0.2%),但是丢失匹配的比例更大(n = 3609/88?596; 4.1%)。由于链接错误,真实的重新录取率被低估了3.8%。来自亚洲/黑人/其他族裔(相对于白人)的年轻男性患者更容易出现假匹配。对于年轻的患者,亚洲/黑人/其他族裔群体(与白人)以及记录中缺少数据的患者,错过比赛更为常见。结论用于将英格兰同一位患者的所有医院护理事件联系起来的确定性算法具有很高的漏选率,这低估了真实的再入院率,并且会产生偏倚的分析。为了减少链接错误,伪匿名算法需要针对高质量的参考标准进行验证。 “从源头”获取数据的假名本身并不能解决患者标识符中的错误以及这些错误对数据链接的影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号