Using deep learning to preserve data confidentiality

首页> 外文期刊>Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies >Using deep learning to preserve data confidentiality

【24h】

Using deep learning to preserve data confidentiality

机译：利用深度学习保留数据机密性

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Preserving data confidentiality is crucial when releasing microdata for public-use. There are a variety of proposed approaches; many of them are based on traditional probability theory and statistics. These approaches mainly focus on masking the original data. In practice, these masking techniques, despite covering part of the data, risk leaving sensitive data open to release. In this paper, we approach this problem using a deep learning-based generative model which generates simulation data to mask the original data. Generating simulation data that holds the same statistical characteristics as the raw data becomes the key idea and also the main challenge in this study. In particular, we explore the statistical similarities between the raw data and the generated data, given that the generated data and raw data are not obviously distinguishable. Two statistical evaluation metrics, Absolute Relative Residual Values and Hellinger Distance, are the evaluation methods we have decided upon to evaluate our results. We also conduct extensive experiments to validate our idea with two real-world datasets: the Census Dataset and the Environmental Dataset.

机译：保留数据机密性在释放微数据以供公共使用时至关重要。有各种提出的方法;其中许多是基于传统概率理论和统计数据。这些方法主要关注掩盖原始数据。在实践中，尽管覆盖了数据的部分，但这些掩蔽技术，风险留出敏感数据以释放。在本文中，我们使用基于深入的学习的生成模型来解决这个问题，该模型生成模拟数据以掩盖原始数据。生成具有与原始数据相同的统计特征的仿真数据成为本研究中的主要思想以及主要挑战。特别是，考虑到所生成的数据和原始数据没有明显可区分，我们探讨了原始数据和生成数据之间的统计相似性。两个统计评估指标，绝对相对残差值和Hellinger距离，是我们决定评估我们的结果的评估方法。我们还开展了广泛的实验，以验证我们的想法与两个现实世界数据集：人口普查数据集和环境数据集。

著录项

来源
《Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies》 |2020年第2期|共13页
作者

展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词
Data confidentiality; Generative model; Statistical evaluation metric;

机译：数据机密性;生成模型;统计评估度量;

相似文献

外文文献
中文文献
专利

1. Using deep learning to preserve data confidentiality [J] . Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies . 2020,第2期

机译：利用深度学习保留数据机密性
2. Privacy-Preserving Deep Learning for the Detection of Protected Health Information in Real-World Data: Comparative Evaluation [J] . Sven Festag, Cord Spreckelsen JMIR formative research. . 2020,第5期

机译：在实际数据中检测保护健康信息的隐私保留深度学习：比较评估
3. An Automatic COPD Diagnosis with Deep Learning on Topology-Preserving Multi Spectral Image of EEG Data [J] . Sugiarto Tommy, Hsu Chun-Lung, Sun Chi-Tien, Basic & clinical pharmacology & toxicology. . 2019,第S1期

机译：具有深度学习的自动COPD诊断，拓扑保留脑电图数据的多谱图像
4. Applying Deep Learning to Preserve Data Confidentiality Keynote Address [C] . Xiaohi Cui IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing . 2018

机译：应用深度学习保留数据机密性主题演讲
5. Disclosure control of confidential data by applying PAC learning theory. [D] . He, Ling. 2005

机译：通过应用PAC学习理论对机密数据进行披露控制。
6. Privacy-Preserving Deep Learning for the Detection of Protected Health Information in Real-World Data: Comparative Evaluation [O] . Sven Festag, Cord Spreckelsen 2020

机译：保护隐私的深度学习用于检测真实数据中的受保护健康信息：比较评估
7. Anonymization as homeomorphic data space transformation for privacy-preserving deep learning [O] . Anastasiia Girka, Vagan Terziyan, Mariia Gavriushenko, 2021

机译：匿名化作为隐私保留深度学习的同族数据空间转换

Using deep learning to preserve data confidentiality

摘要

著录项

相似文献

相关主题

期刊订阅