Ensemble-based Methods to Improve De-identification of Electronic Health Record Narratives

机译：基于集成的方法来改善电子病历叙述的去识别性

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Text de-identification is an application of clinical natural language processing that offers significant efficiency and scalability advantages. Hence, various learning algorithms have been applied to this task to yield better performance. Instead of choosing the best individual learning algorithm, we aim to improve de-identification by constructing ensembles that lead to more accurate classification. We present three different ensemble methods that combine multiple de-identification models trained from deep learning, shallow learning, and rule-based approaches. Each model is capable of automated de-identification without manual medical expertise. Our experimental results show that the stacked learning ensemble is more effective than other ensemble methods, producing the highest recall, the most important metric for de-identification. The stacked ensemble achieved state-of-the-art performance on the 2014 i2b2 dataset with 97.04% precision, 94.45% recall, and 95.73% F1 score.

机译：文本取消识别是临床自然语言处理的一种应用，具有显着的效率和可伸缩性优势。因此，各种学习算法已应用于此任务以产生更好的性能。我们没有选择最佳的个体学习算法，而是旨在通过构建可导致更准确分类的合奏来提高去识别性。我们提出了三种不同的集成方法，这些方法结合了从深度学习，浅层学习和基于规则的方法中训练来的多个去标识模型。每种模型都能够在无需人工医学专业知识的情况下自动进行身份识别。我们的实验结果表明，堆叠学习集成比其他集成方法更有效，产生了最高的召回率，这是取消识别的最重要指标。堆叠的整体在2014 i2b2数据集上达到了最先进的性能，准确率达97.04％，召回率达94.45％，F1得分达95.73％。

著录项

期刊名称 AMIA Annual Symposium Proceedings
作者
Youngjun Kim; Paul Heider; Stéphane Meystre;
展开▼
作者单位

展开▼
年(卷),期 2018(2018),-1
年度 2018
页码 663–672
总页数 10
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Development of Automated Methods for Big Data to Achieve Compliance With IRB, Institutional, and Federal Requirements in the De-Identification of Narratives and Structured Data Focused on Safety Signals for Adverse Drug or Device Events From Electronic Medical Records [J] . West Dennis P., Temps William H., Tice Debra G., Journal of empirical research on human research ethics : . 2016,第1期

机译：大数据自动化方法的开发，以使叙事和结构化数据的去标识化符合IRB，机构和联邦要求，重点在于电子病历中不良药物或设备事件的安全信号
2. Methods for the de-identification of electronic health records for genomic research [J] . Khaled El Emam Genome Medicine . 2011,第4期

机译：用于基因组研究的电子病历的去识别方法
3. Beyond Getting Rid of Stupid Stuff in the Electronic Health Record (Beyond-GROSS): Protocol for a User-Centered, Mixed-Method Intervention to Improve the Electronic Health Record System [J] . Ahmed Umar Otokiti, Catherine K Craven, Avniel Shetreat-Klein, JMIR Research Protocols . 2021,第3期

机译：除了在电子健康记录（超越）中摆脱愚蠢的东西（超越）：用于用户中心的混合方法干预的协议，以改善电子健康记录系统
4. Generation of Surrogates for De-Identification of Electronic Health Records [C] . Aipeng Chen, Jitendra Jonnagaddala, Chini Nekkantti, MEDINFO . 2019

机译：去鉴定电子健康记录的代理
5. Development of National and Sub-national Electronic Health Records to Enable Health Data Exchange for Improved Maternal Health Service Delivery and Program: The Case of a Tertiary Care State Government Hospital in India [D] . Kumar, Manish. 2021

机译：国家和亚国家电子卫生记录的发展，以实现卫生数据交换，以改善产妇卫生服务交付和计划：印度高等教育州政府医院的情况
6. Methods for the de-identification of electronic health records for genomic research [O] . Khaled El Emam 2011

机译：用于基因组研究的电子病历的去识别方法
7. Methods for the de-identification of electronic health records for genomic research [O] . El Emam, Khaled 2011

机译：用于基因组研究的电子病历的去识别方法
8. Vital and Health Statistics, Series 2, Number 143. Assessing the Potential of National Strategies for Electronic Health Records for Population Health Monitoring and Research. Data Evaluation and Methods Research [R] . 2006

机译：生命和健康统计，系列2，编号143.评估国家人口健康监测和研究电子健康记录战略的潜力。数据评估与方法研究

Ensemble-based Methods to Improve De-identification of Electronic Health Record Narratives

摘要

著录项

相似文献

相关主题

期刊订阅