首页> 外文OA文献 >Development and evaluation of a de-identification procedure for a case register sourced from mental health electronic records.
【2h】

Development and evaluation of a de-identification procedure for a case register sourced from mental health electronic records.

机译:开发和评估从心理健康电子记录中获取的病案登记的去识别程序。

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Background: Electronic health records (EHRs) provide enormous potential for health research but also present data governance challenges. Ensuring de-identification is a pre-requisite for use of EHR data without prior consent. The South London and Maudsley NHS Trust (SLaM), one of the largest secondary mental healthcare providers in Europe, has developed, from its EHRs, a de-identified psychiatric case register, the Clinical Record Interactive Search (CRIS), for secondary research.ududMethods: We describe development, implementation and evaluation of a bespoke de-identification algorithm used to create the register. It is designed to create dictionaries using patient identifiers (PIs) entered into dedicated source fields and then identify, match and mask them (with ZZZZZ) when they appear in medical texts. We deemed this approach would be effective, given high coverage of PI in the dedicated fields and the effectiveness of the masking combined with elements of a security model. We conducted two separate performance tests i) to test performance of the algorithm in masking individual true PIs entered in dedicated fields and then found in text (using 500 patient notes) and ii) to compare the performance of the CRIS pattern matching algorithm with a machine learning algorithm, called the MITRE Identification Scrubber Toolkit – MIST (using 70 patient notes – 50 notes to train, 20 notes to test on). We also report any incidences of potential breaches, defined by occurrences of 3 or more true or apparent PIs in the same patient’s notes (and in an additional set of longitudinal notes for 50 patients); and we consider the possibility of inferring information despite de-identification.ududResults: True PIs were masked with 98.8% precision and 97.6% recall. As anticipated, potential PIs did appear, owing to misspellings entered within the EHRs. We found one potential breach. In a separate performance test, with a different set of notes, CRIS yielded 100% precision and 88.5% recall, while MIST yielded a 95.1% and 78.1%, respectively. We discuss how we overcome the realistic possibility – albeit of low probability – of potential breaches through implementation of the security model.ududConclusion: CRIS is a de-identified psychiatric database sourced from EHRs, which protects patient anonymity and maximises data available for research. CRIS demonstrates the advantage of combining an effective de-identification algorithm with a carefully designed security model. The paper advances much needed discussion of EHR de-identification – particularly in relation to criteria to assess de-identification, and considering the contexts of de-identified research databases when assessing the risk of breaches of confidential patient information.
机译:背景:电子健康记录(EHR)为健康研究提供了巨大的潜力,但同时也带来了数据治理方面的挑战。未经事先同意,使用EHR数据的前提是要确保取消身份识别。欧洲最大的二级精神保健服务提供商之一,南伦敦和Maudsley NHS信托(SLaM)已从其EHR中开发了一种身份不明的精神病病例登记表,即临床记录交互式搜索(CRIS),用于二级研究。 ud udMethods:我们描述了用于创建寄存器的定制去识别算法的开发,实现和评估。它旨在使用输入到专用源字段中的患者标识符(PI)创建词典,然后在它们出现在医学文本中时进行识别,匹配和屏蔽(使用ZZZZZ)。考虑到PI在专用领域中的覆盖率很高以及屏蔽与安全模型元素相结合的有效性,我们认为这种方法将是有效的。我们进行了两个单独的性能测试:i)测试该算法在掩盖在专用字段中输入的单个真实PI,然后在文本中找到的真实PI(使用500个患者注释)的性能,以及ii)将CRIS模式匹配算法与机器的性能进行比较学习算法,称为MITER识别洗涤器工具包-MIST(使用70个患者笔记-训练50个笔记,测试20个笔记)。我们还报告了任何潜在的违规事件,这些事件是由同一位患者的笔记中出现3个或更多个真实或明显的PI(以及另外50个患者的纵向笔记中)所定义的; ud ud结果:真实的PI被掩盖了98.8%的精度和97.6%的查全率。如预期的那样,由于EHR中输入的拼写错误,确实出现了潜在的PI。我们发现了一个潜在的漏洞。在单独的性能测试中,使用一组不同的注释,CRIS产生了100%的准确度和88.5%的查全率,而MIST分别产生了95.1%和78.1%。我们讨论了如何通过实施安全模型来克服潜在违规的现实可能性(尽管可能性很小)。 ud ud结论:CRIS是源自EHR的身份不明的精神病学数据库,可保护患者匿名并最大化可用于研究。 CRIS展示了将有效的去识别算法与精心设计的安全模型相结合的优势。本文推进了有关EHR取消身份识别的急切讨论,尤其是与评估取消身份识别的标准有关的讨论,并在评估违反机密患者信息的风险时考虑了未识别身份的研究数据库的情况。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号