A Short Survey of LSTM Models for De-identification of Medical Free Text

机译：对LSTM模型的简短调查，用于证明医疗自由文本

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The confidentiality of patient information is legislated by governmental regulations in various countries, such as the Health Insurance Portability and Accountability Act (HIPAA) standards in the USA. Under these laws, adequate protections must be in place to safeguard patients' health records, which are often big data comprised of free text. Machine learning approaches are extensively used for the automated de-identification of medical free text, with outstanding results obtained from several studies that incorporate long short-term memory (LSTM) networks. These networks are a variant of the recurrent neural network (RNN) architecture. Our survey of LSTM models dates back five years, and the contribution of the findings is appreciable. Performance-wise, LSTMs generally surpassed other types of models used in automated de-identification of free text, namely conditional random field (CRF) algorithms and rule-based algorithms. In addition, hybrid or ensemble LSTM models did not outperform LSTM -only models. Finally, we note that the customization of gold-standard, de-identification datasets may result in overfitted models.

机译：患者信息的机密性受到各国政府法规的立法，例如美国的健康保险便携式和问责法（HIPAA）标准。根据这些法律，必须制定足够的保护以保护患者的健康记录，这通常是由自由文本组成的大数据。机器学习方法广泛用于医疗自由识别的自动解除识别，从若干研究中获得了优异的结果，这些研究包括长短短期内存（LSTM）网络。这些网络是经常性神经网络（RNN）架构的变体。我们对LSTM模型的调查历史追溯到五年，结果可观。性能明智，LSTM通常超过自动取消识别自由文本的其他类型的模型，即条件随机字段（CRF）算法和基于规则的算法。此外，混合或集合LSTM模型没有胜过LSTM-only模型。最后，我们注意到金标的定制，去识别数据集可能导致过度的模型。

著录项

来源
《IEEE International Conference on Collaboration and Internet Computing》|2020年|117-124|共8页
会议地点
作者
Joffrey L. Leevy; Taghi M. Khoshgoftaar;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Training; Task analysis; Machine learning; MIMICs; Recurrent neural networks; Logic gates; Tuning;

机译：培训;任务分析;机器学习;模仿;经常性神经网络;逻辑门;调整;

相似文献

外文文献
中文文献
专利

1. De-identification of primary care electronic medical records free-text data in Ontario, Canada [J] . Karen Tu, Julie Klein-Geltink, Tezeta F Mitiku, BMC Medical Informatics and Decision Making . 2010,第1期

机译：在加拿大安大略省取消对初级保健电子医疗记录自由文本数据的标识
2. Automated de-identification of free-text medical records [J] . Ishna Neamatullah, Margaret M Douglass, Li-wei H Lehman, BMC Medical Informatics and Decision Making . 2008,第1期

机译：自动取消识别自由文本的病历
3. Improved de-identification of physician notes through integrative modeling of both public and private medical text [J] . Andrew J McMurry, Britt Fitch, Guergana Savova, BMC Medical Informatics and Decision Making . 2013,第1期

机译：通过对公共和私人医学文本进行集成建模，改进了对医生笔记的去身份识别
4. De-identification of free-text medical records in health information exchange [C] . ZHOU Tian-shu, LI Peng-fei, LI Jing-song International Workshop on Cloud Computing and Information Security . 2013

机译：在健康信息交换中的自由文本医疗记录去识别
5. Improving the failure-to-attend occurrences in an inner-city family medical practice: Utilizing short message system text messaging as a patient reminder system. [D] . Leonard, Takesha La'Shawn. 2015

机译：改善市区内家庭医疗实践中出现的故障率：将短消息系统文本消息作为患者提醒系统。
6. De-identification of Clinical Text via Bi-LSTM-CRF with Neural Language Models [O] . Buzhou Tang, Dehuan Jiang, Qingcai Chen, 2019

机译：通过带有神经语言模型的Bi-LSTM-CRF取消对临床文本的识别
7. DE-IDENTIFICATION OF PROTECTED HEALTH INFORMATION PHI FROM FREE TEXT IN MEDICAL RECORDS [O] . Geetha Mahadevaiah, M.S Dinesh, Rithesh Sreenivasan, 2019

机译：从医疗记录中的自由文本取消识别受保护的健康信息PHI

A Short Survey of LSTM Models for De-identification of Medical Free Text

摘要

著录项

相似文献

相关主题

期刊订阅