首页> 外文期刊>Computer Communications >N-Sanitization: A semantic privacy-preserving framework for unstructured medical datasets
【24h】

N-Sanitization: A semantic privacy-preserving framework for unstructured medical datasets

机译:N-Sanitization:非结构化医疗数据集的语义隐私保留框架

获取原文
获取原文并翻译 | 示例
           

摘要

The introduction and rapid growth of the Internet of Medical Things (IoMT), a subset of the Internet of Things (IoT) in the medical and healthcare systems, has brought numerous changes and challenges to current medical and healthcare systems. Healthcare organizations share data about patients with research organizations for various medical discoveries. Releasing such information is a tedious task since it puts the privacy of patients at risk with the understanding that textual health documents about an individual contains specific sensitive terms that need to be sanitized before such document can be released. Recent approaches improved the utility of protected output by substituting sensitive terms with appropriate "generalizations'' that are retrieved from several medical and general-purpose knowledge bases (KBs). However, these approaches perform unnecessary sanitization by anonymizing the negated assertions, e.g., AIDS-negative. This paper proposes a semantic privacy framework that effectively sanitizes the sensitive and semantically related terms in healthcare documents. The proposed model effectively identifies the negated assertions (e.g., AIDS-negative) before the sanitization process in IoMT which further improves the utility of sanitized documents. Moreover, besides considering the sensitive medical findings, we also incorporated state-of-the-art metrics, i.e., Protected Health Information (PHI), as defined in the privacy rules such as Health Insurance Portability and Accountability Act (HIPAA), Informatics for Integrating Biology & the Bedside (i2b2), and Materialize Interactive Medical Image Control System (MIMICS). The proposed approach is evaluated on real clinical data provided by i2b2. On average the detection (for both PHI's and medical findings) accuracy is improved with Precision, Recall and F-measure score at 21%, 51%, and 54% respectively. The overall improved data utility of our proposed model is 8% as compared to C-sanitized and 25% when comparing it with a simple reduction approach. Experimental results show that our approach effectively manages the privacy and utility trade-off as compared to its counterparts.
机译:医疗和医疗保健系统中,医疗器互联网(IOT)的介绍和快速增长,对当前的医疗和医疗保健系统带来了许多变化和挑战。医疗组织共享有关各种医疗发现的研究组织患者的数据。释放此类信息是一项繁琐的任务,因为它将患者的隐私性带来了风险,并且了解个人关于个人的文本健康文件包含需要在此类文件释放之前需要消毒的特定敏感术语。最近的方法通过从几种医疗和通用知识库(KBS)中检索的适当的“概括”来改善受保护的输出的效用。但是,这些方法通过匿名否定断言,例如艾滋病来表现不必要的消毒 - 本文提出了一个语义隐私框架,有效地消毒了医疗文件中的敏感和语义相关术语。所提出的模型在IOMT中的消毒过程之前有效地识别否定的断言(例如,艾滋病 - 负),这进一步改善了效用消毒文件。此外,除了考虑敏感的医学结果,我们还纳入了最先进的指标,即受保护的健康信息(PHI),如保健保险便携性和问责法(HIPAA)所定义,整合生物学和床头(I2B2)的信息学,并实现互动M edice图像控制系统(模拟)。所提出的方法是在I2B2提供的真实临床数据上进行评估。平均检测(PHI和医学发现)的精确度分别提高了精度,召回和5%,51%和54%。与简单的减少方法比较时,我们拟议模型的整体改进数据效用为8%和25%。实验结果表明,与同行相比,我们的方法有效地管理隐私和公用事业权衡。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号