N-Sanitization: A semantic privacy-preserving framework for unstructured medical datasets

Iwendi Celestine; Moqurrab Syed Atif; Anjum Adeel; Khan Sangeen; Mohan Senthilkumar; Srivastava Gautam

首页> 外文期刊>Computer Communications >N-Sanitization: A semantic privacy-preserving framework for unstructured medical datasets

【24h】

N-Sanitization: A semantic privacy-preserving framework for unstructured medical datasets

机译：N-Sanitization：非结构化医疗数据集的语义隐私保留框架

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The introduction and rapid growth of the Internet of Medical Things (IoMT), a subset of the Internet of Things (IoT) in the medical and healthcare systems, has brought numerous changes and challenges to current medical and healthcare systems. Healthcare organizations share data about patients with research organizations for various medical discoveries. Releasing such information is a tedious task since it puts the privacy of patients at risk with the understanding that textual health documents about an individual contains specific sensitive terms that need to be sanitized before such document can be released. Recent approaches improved the utility of protected output by substituting sensitive terms with appropriate "generalizations'' that are retrieved from several medical and general-purpose knowledge bases (KBs). However, these approaches perform unnecessary sanitization by anonymizing the negated assertions, e.g., AIDS-negative. This paper proposes a semantic privacy framework that effectively sanitizes the sensitive and semantically related terms in healthcare documents. The proposed model effectively identifies the negated assertions (e.g., AIDS-negative) before the sanitization process in IoMT which further improves the utility of sanitized documents. Moreover, besides considering the sensitive medical findings, we also incorporated state-of-the-art metrics, i.e., Protected Health Information (PHI), as defined in the privacy rules such as Health Insurance Portability and Accountability Act (HIPAA), Informatics for Integrating Biology & the Bedside (i2b2), and Materialize Interactive Medical Image Control System (MIMICS). The proposed approach is evaluated on real clinical data provided by i2b2. On average the detection (for both PHI's and medical findings) accuracy is improved with Precision, Recall and F-measure score at 21%, 51%, and 54% respectively. The overall improved data utility of our proposed model is 8% as compared to C-sanitized and 25% when comparing it with a simple reduction approach. Experimental results show that our approach effectively manages the privacy and utility trade-off as compared to its counterparts.

机译：医疗和医疗保健系统中，医疗器互联网（IOT）的介绍和快速增长，对当前的医疗和医疗保健系统带来了许多变化和挑战。医疗组织共享有关各种医疗发现的研究组织患者的数据。释放此类信息是一项繁琐的任务，因为它将患者的隐私性带来了风险，并且了解个人关于个人的文本健康文件包含需要在此类文件释放之前需要消毒的特定敏感术语。最近的方法通过从几种医疗和通用知识库（KBS）中检索的适当的“概括”来改善受保护的输出的效用。但是，这些方法通过匿名否定断言，例如艾滋病来表现不必要的消毒 - 本文提出了一个语义隐私框架，有效地消毒了医疗文件中的敏感和语义相关术语。所提出的模型在IOMT中的消毒过程之前有效地识别否定的断言（例如，艾滋病 - 负），这进一步改善了效用消毒文件。此外，除了考虑敏感的医学结果，我们还纳入了最先进的指标，即受保护的健康信息（PHI），如保健保险便携性和问责法（HIPAA）所定义，整合生物学和床头（I2B2）的信息学，并实现互动M edice图像控制系统（模拟）。所提出的方法是在I2B2提供的真实临床数据上进行评估。平均检测（PHI和医学发现）的精确度分别提高了精度，召回和5％，51％和54％。与简单的减少方法比较时，我们拟议模型的整体改进数据效用为8％和25％。实验结果表明，与同行相比，我们的方法有效地管理隐私和公用事业权衡。

著录项

来源
《Computer Communications》 |2020年第9期|160-171|共12页
作者
Iwendi Celestine; Moqurrab Syed Atif; Anjum Adeel; Khan Sangeen; Mohan Senthilkumar; Srivastava Gautam;
展开▼
作者单位

Bcc Cent South Univ Forestry & Technol Changsha 410004 Peoples R China|Coal City Univ Enugu Dept Math & Comp Sci Enugu 400231 Nigeria;

Air Univ Islamabad Islamabad 44000 Pakistan|Comsats Inst Informat Technol Islamabad 45550 Pakistan;

Comsats Inst Informat Technol Islamabad 45550 Pakistan;

Comsats Inst Informat Technol Islamabad 45550 Pakistan;

Vellore Inst Technol Sch Informat Technol & Engn Vellore 632014 Tamil Nadu India;

Brandon Univ Dept Math & Comp Sci Brandon MB R7A 6A9 Canada|China Med Univ Res Ctr Interneural Comp Taichung 40402 Taiwan;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Anonymization; Document sanitization; Textual-privacy; Negated assertion; Medical data; IoMT;

机译：匿名化;文件消毒;文本隐私;否定断言;医疗数据;IOMT;

相似文献

外文文献
中文文献
专利

1. Integration of Neuroimaging and Microarray Datasets through Mapping and Model-Theoretic Semantic Decomposition of Unstructured Phenotypes [J] . Spiro P. Pantazatos, Jianrong Li, Paul Pavlidis, Cancer Informatics . 2009,第7期

机译：通过成像和非结构化表型的模型理论语义分解整合神经影像和微阵列数据集。
2. P4Mobi: A Probabilistic Privacy-Preserving Framework for Publishing Mobility Datasets [J] . Yang Qing, Shen Yiran, Vatsalan Dinusha, IEEE Transactions on Vehicular Technology . 2020,第7期

机译：p4mobi：用于发布移动数据集的概率隐私保留框架
3. Evaluating privacy-preserving record linkage using cryptographic long-term keys and multibit trees on large medical datasets [J] . Adrian P. Brown, Christian Borgs, Sean M. Randall, BMC Medical Informatics and Decision Making . 2017,第1期

机译：在大型医疗数据集上使用加密的长期密钥和多位树评估隐私保护记录链接
4. Privacy-Preserving Multiple Linear Regression of Vertically Partitioned Real Medical Datasets [C] . Hiroaki Kikuchi, Chika Hamanaga, Hideo Yasunaga, IEEE International Conference on Advanced Information Networking and Applications . 2017

机译：垂直划分的实际医学数据集的隐私保护多元线性回归
5. Using Semantic Web Tools to Create An Integrated Framework For Biomedical Research [D] . Holford, Matthew Edwin. 2014

机译：使用语义Web工具创建生物医学研究的集成框架
6. Integration of Neuroimaging and Microarray Datasets through Mapping and Model-Theoretic Semantic Decomposition of Unstructured Phenotypes [O] . Spiro P. Pantazatos, Jianrong Li, Paul Pavlidis, 2009

机译：通过非结构化表型的映射和模型理论语义分解整合神经影像和微阵列数据集。
7. Evaluating privacy-preserving record linkage using cryptographic long-term keys and multibit trees on large medical datasets [O] . Adrian P. Brown, Christian Borgs, Sean M. Randall, 2017

机译：在大型医疗数据集上使用加密长期键和多点树进行评估隐私记录联系

N-Sanitization: A semantic privacy-preserving framework for unstructured medical datasets

摘要

著录项

相似文献

相关主题

期刊订阅