首页> 外文会议>IEEE International Conference on Healthcare Informatics, Imaging and Systems Biology >A Randomized Response Model for Privacy-Preserving Data Dissemination
【24h】

A Randomized Response Model for Privacy-Preserving Data Dissemination

机译:保留隐私数据传播的随机响应模型

获取原文
获取外文期刊封面目录资料

摘要

Public dissemination of medical data encourages meaningful research and quality improvement. However, there is a big concern that improper disclosure may put sensitive personal information at risk. To maintain the research benefits and customize the privacy protection, we propose a novel and practical randomized response model (k-shuffle) and a statistical information recovery procedure. The former mixes distribution of patient records with samples drawn from k-1 pre-determined distributions to ensure differential privacy. The latter allows data receivers to recover statistical properties (e.g., the mean and variance) of interested sub-populations with accuracy proportional to the size of the sub-population. That is, our algorithm provides stronger privacy protection to smaller groups, and offers high data usability to studies targeted at larger population. Most importantly, with differential privacy guarantee, data receiver cannot reconstruct the record-to-identity mapping for each individual. In summary, our approach offers a scalable privacy-preserving data dissemination mechanism that can be applied in both centralized and distributed fashion, which makes it possible for perturbed data to be outsourced (in the cloud) with mitigated privacy risks. Our experimental results demonstrated the performance of our model in terms of privacy protection, information loss, and classification accuracy using both synthetic and real-world datasets.
机译:公众传播医疗数据鼓励有意义的研究和质量改进。然而,有一个很好的关切,即不正当的披露可能将敏感的个人信息造成风险。为了维持研究效益并定制隐私保护,我们提出了一种新颖和实际的随机响应模型(K-Shuffle)和统计信息恢复程序。前者将患者记录分布与从K-1预定分布中汲取的样品进行分布,以确保差异隐私。后者允许数据接收器以与子群群体的大小成比例的准确性恢复感兴趣的子群的统计属性(例如,均值和方差)。也就是说,我们的算法向较小的群体提供了更强烈的隐私保护,并提供高数据可用性,以在更大的人口上进行研究。最重要的是,通过差异隐私保证,数据接收器无法重建每个单独的记录到标识映射。总之,我们的方法提供了可扩展的隐私保留数据传播机制,可以以集中式和分布式的方式应用,这使得扰动数据可以在具有减少的隐私风险的情况下外包(在云中)。我们的实验结果表明,在隐私保护,信息丢失和分类准确性使用综合性和现实世界数据集的情况表明了我们的模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号