Data linkage errors in hospital administrative data when applying a pseudonymisation algorithm to paediatric intensive care records

Gareth Hagger-Johnson; Harvey Goldstein; Katie Harron; Rebecca Landy; Roger C Parslow; Ruth Gilbert; Tom Fleming

首页> 外文期刊>BMJ Open >Data linkage errors in hospital administrative data when applying a pseudonymisation algorithm to paediatric intensive care records

【24h】

Data linkage errors in hospital administrative data when applying a pseudonymisation algorithm to paediatric intensive care records

机译：将假名化算法应用于儿科重症监护记录时，医院管理数据中的数据链接错误

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Objectives Our aim was to estimate the rate of data linkage error in Hospital Episode Statistics (HES) by testing the HESID pseudoanonymisation algorithm against a reference standard, in a national registry of paediatric intensive care records. Setting The Paediatric Intensive Care Audit Network (PICANet) database, covering 33 paediatric intensive care units in England, Scotland and Wales. Participants Data from infants and young people aged 0–19?years admitted between 1 January 2004 and 21 February 2014. Primary and secondary outcome measures PICANet admission records were classified as matches (records belonging to the same patient who had been readmitted) or non-matches (records belonging to different patients) after applying the HESID algorithm to PICANet records. False-match and missed-match rates were calculated by comparing results of the HESID algorithm with the reference standard PICANet ID. The effect of linkage errors on readmission rate was evaluated. Results Of 166?406 admissions, 88?596 were true matches (where the same patient had been readmitted). The HESID pseudonymisation algorithm produced few false matches (n=176/77?810; 0.2%) but a larger proportion of missed matches (n=3609/88?596; 4.1%). The true readmission rate was underestimated by 3.8% due to linkage errors. Patients who were younger, male, from Asian/Black/Other ethnic groups (vs White) were more likely to experience a false match. Missed matches were more common for younger patients, for Asian/Black/Other ethnic groups (vs White) and for patients whose records had missing data. Conclusions The deterministic algorithm used to link all episodes of hospital care for the same patient in England has a high missed match rate which underestimates the true readmission rate and will produce biased analyses. To reduce linkage error, pseudoanonymisation algorithms need to be validated against good quality reference standards. Pseudonymisation of data ‘at source’ does not itself address errors in patient identifiers and the impact these errors have on data linkage.

机译：目的我们的目的是通过在国家儿科重症监护记录中对照参考标准测试HESID伪匿名算法来估计医院情节统计（HES）中数据链接错误的发生率。设置儿科重症监护审核网络（PICANet）数据库，涵盖英格兰，苏格兰和威尔士的33个儿科重症监护病房。参与者来自2004年1月1日至2014年2月21日之间的0-19岁婴幼儿数据。主要和次要指标PICANet入院记录分为匹配项（属于已再次入院的同一患者的记录）或非匹配项。在将HESID算法应用于PICANet记录后进行匹配（属于不同患者的记录）。通过将HESID算法的结果与参考标准PICANet ID进行比较，可以计算出不匹配率和不匹配率。评估了连锁错误对再入院率的影响。结果166〜406例入院病例中，88〜596例为真匹配（同一患者已被重新入院）。 HESID假名化算法产生的错误匹配很少（n = 176/77？810; 0.2％），但是丢失匹配的比例更大（n = 3609/88？596; 4.1％）。由于链接错误，真实的重新录取率被低估了3.8％。来自亚洲/黑人/其他族裔（相对于白人）的年轻男性患者更容易出现假匹配。对于年轻的患者，亚洲/黑人/其他族裔群体（与白人）以及记录中缺少数据的患者，错过比赛更为常见。结论用于将英格兰同一位患者的所有医院护理事件联系起来的确定性算法具有很高的漏选率，这低估了真实的再入院率，并且会产生偏倚的分析。为了减少链接错误，伪匿名算法需要针对高质量的参考标准进行验证。 “从源头”获取数据的假名本身并不能解决患者标识符中的错误以及这些错误对数据链接的影响。

著录项

来源
《BMJ Open》 |2015年第8期|共页
作者
Gareth Hagger-Johnson; Harvey Goldstein; Katie Harron; Rebecca Landy; Roger C Parslow; Ruth Gilbert; Tom Fleming;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类临床医学;
关键词

相似文献

外文文献
中文文献
专利

1. Data linkage errors in hospital administrative data when applying a pseudonymisation algorithm to paediatric intensive care records [J] . Gareth Hagger-Johnson, Harvey Goldstein, Katie Harron, BMJ Open . 2015,第8期

机译：将假名化算法应用于儿科重症监护记录时，医院管理数据中的数据链接错误
2. Comparing routine administrative data with registry data for assessing quality of hospital care in patients with myocardial infarction using deterministic record linkage [J] . Birga Maier, Katrin Wagner, Steffen Behrens, BMC Health Services Research . 2016,第1期

机译：使用确定性记录链接将常规行政数据与注册表数据进行比较，以评估心肌梗死患者的医院护理质量
3. Combining deterministic and probabilistic matching to reduce data linkage errors in hospital administrative data [J] . Gareth Hagger-Johnson, Katie Harron, Rob Aldridge, International Journal of Population Data Science . 2017,第1期

机译：结合确定性和概率匹配以减少医院管理数据中的数据链接错误
4. Enhancing Nationwide Medico-Administrative Databases Analysis with SAF4SUHAD: A Statistical Analysis Framework for Secondary Use of Healthcare Administrative Databases [C] . Alexra GEORGES, Thibaut BALCAEN, Alexra CARON, EFMI STC 2018 . 2018

机译：使用SAF4Suhad增强全国医疗管理数据库分析：统计分析框架用于次要使用医疗管理管理数据库
5. Development of National and Sub-national Electronic Health Records to Enable Health Data Exchange for Improved Maternal Health Service Delivery and Program: The Case of a Tertiary Care State Government Hospital in India [D] . Kumar, Manish. 2021

机译：国家和亚国家电子卫生记录的发展，以实现卫生数据交换，以改善产妇卫生服务交付和计划：印度高等教育州政府医院的情况
6. Data linkage errors in hospital administrative data when applying a pseudonymisation algorithm to paediatric intensive care records [O] . Gareth Hagger-Johnson, Katie Harron, Tom Fleming, 2015

机译：将假名化算法应用于儿科重症监护记录时医院管理数据中的数据链接错误
7. Data linkage errors in hospital administrative data when applying a pseudonymisation algorithm to paediatric intensive care records [O] . Hagger-Johnson Gareth, Harron Katie, Fleming Tom, 2015

机译：将假名化算法应用于儿科重症监护记录时，医院管理数据中的数据链接错误
8. Use of Electronic Medical Records and Administrative Claims Data for Assessing Type 2 Diabetes Care. Effective Health Care Research Reports No. 18 [R] . West, S. L., Liu, Z., McKay, J. N., 2010

机译：使用电子病历和行政索赔数据评估2型糖尿病护理。有效的医疗保健研究报告第18号

Data linkage errors in hospital administrative data when applying a pseudonymisation algorithm to paediatric intensive care records

摘要

著录项

相似文献

相关主题

期刊订阅