Data Sharing via Differentially Private Coupled Matrix Factorization

BEYZA ERMiS; A. TAYLAN CEMGIL

首页> 外文期刊>ACM transactions on knowledge discovery from data >Data Sharing via Differentially Private Coupled Matrix Factorization

【24h】

Data Sharing via Differentially Private Coupled Matrix Factorization

机译：通过差分私有耦合矩阵分解的数据共享

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We address the privacy-preserving data-sharing problem in a distributed multiparty setting. In this setting, each data site owns a distinct part of a dataset and the aim is to estimate the parameters of a statistical model conditioned on the complete data without any site revealing any information about the individuals in their own parts. The sites want to maximize the utility of the collective data analysis while providing privacy guarantees for their own portion of the data as well as for each participating individual. Our first contribution is to classify these different privacy requirements as (i) site-level and (ⅱ) user-level differential privacy and present formal privacy guarantees for these two cases under the model of differential privacy. To satisfy a stronger form of differential privacy, we use a variant of differential privacy which is local differential privacy where the sensitive data is perturbed with a randomized response mechanism prior to the estimation. In this study, we assume that the data instances that are partitioned between several parties are arranged as matrices. A natural statistical model for this distributed scenario is coupled matrix factorization. We present two generic frameworks for privatizing Bayesian inference for coupled matrix factorization models that are able to guarantee proposed differential privacy notions based on the privacy requirements of the model. To privatize Bayesian inference, we first exploit the connection between differential privacy and sampling from a Bayesian posterior via stochastic gradient Langevin dynamics and then derive an efficient coupled matrix factorization method. In the local privacy context, we propose two models that have an additional privatization mechanism to achieve a stronger measure of privacy and introduce a Gibbs sampling based algorithm. We demonstrate that the proposed methods are able to provide good prediction accuracy on synthetic and real datasets while adhering to the introduced privacy constraints.

机译：我们在分布式多党设置中解决了隐私保留数据共享问题。在此设置中，每个数据站点拥有数据集的不同部分，目的是估计在完整数据上的统计模型的参数，而无需任何网站在其自己的部分中显示有关个人的任何信息。这些网站希望最大化集体数据分析的效用，同时为其自己的部分提供隐私保障，以及每个参与的个人。我们的第一个贡献是将这些不同的隐私要求分类为（i）网站级别和（Ⅱ）用户级差异隐私，并在差异隐私模式下为这两种情况提供正式隐私保障。为了满足更强大的差异隐私形式，我们使用差分隐私的变体，这是局部差异隐私，其中敏感数据在估计之前用随机响应机制扰乱。在本研究中，我们假设在多个方之间分区的数据实例被安排为矩阵。这种分布式场景的自然统计模型是耦合矩阵分子。我们为私有化贝叶斯因素推断提供了两个通用框架，以便能够根据模型的隐私要求保证提出的差异隐私概念。为了私有化贝叶斯推理，我们首先利用差异隐私和从贝叶斯后续进行采样之间的连接，通过随机梯度Langevin动态，然后导出有效的耦合矩阵分解方法。在本地隐私背景下，我们提出了两个具有额外私有化机制的模型，以实现更强大的隐私权并引入基于GIBBS采样的算法。我们证明，该方法能够在综合和实际数据集中提供良好的预测准确性，同时遵守介绍的隐私约束。

著录项

来源
《ACM transactions on knowledge discovery from data》 |2020年第3期|28.1-28.27|共27页
作者
BEYZA ERMiS; A. TAYLAN CEMGIL;
展开▼
作者单位

Amazon Research;

Bogazici University;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Differential privacy; local differential privacy; collective matrix factorization; distributed data; Markov Chain Monte Carlo (MCMC); stochastic gradient Langevin dynamics (SGLD);

机译：差异隐私;地方差别隐私;集体矩阵分解;分布式数据;马尔可夫链蒙特卡罗（MCMC）;随机梯度Langevin动力学（SGLD）;

相似文献

外文文献
中文文献
专利

1. Sparsity Reduction Technique Using Grouping Method for Matrix Factorization in Differentially Private Recommendation Systems [J] . Taewhan KIM, Kangsoo JUNG, Seog PARK IEICE transactions on information and systems . 2020,第7期

机译：差异化方法在差别私人推荐系统中使用分组方法的稀疏性减少技术
2. Jo-DPMF: Differentially private matrix factorization learning through joint optimization [J] . Zhang Feng, Lee Victor E., Choo Kim-Kwang Raymond Information Sciences: An International Journal . 2018,第期

机译：jo-dpmf：通过联合优化进行差异私有矩阵分解学习
3. Expressway Traffic Flow Missing Data Repair Method Based on Coupled Matrix-Tensor Factorizations [J] . Hui Jiang, Hongxing Deng Mathematical Problems in Engineering: Theory, Methods and Applications . 2021,第a期

机译：基于耦合矩阵张量沉积件的高速公路交通流量缺失数据修复方法
4. Differentially Private Matrix Completion via Distributed Matrix Factorization [C] . Haotian Zhou, Xiao-Yang Liu, Cai Fu, 2018 17th IEEE International Conference on Trust, Security and Privacy In Computing and Communications, 12th IEEE International Conference on Big Data Science and Engineering . 2018

机译：通过分布式矩阵分解实现差分私有矩阵完成
5. Decentralized Differentially Private Algorithms for Matrix and Tensor Factorization [D] . Imtiaz, Hafiz . 2020

机译：分散的差异私有算法，用于矩阵和张量分解
6. Integrative analysis of single-cell genomics data by coupled nonnegative matrix factorizations [O] . Zhana Duren, Xi Chen, Mahdi Zamanighomi, 2018

机译：通过耦合非负矩阵分解对单细胞基因组数据进行综合分析
7. Coupled matrix-tensor factorizations — The case of partially shared factors [O] . Lieven De Lathauwer, Eleftherios Kofidis 2017

机译：耦合矩阵 - 张量因子 - 部分共享因子的情况
8. QR Factorization of a Dense Matrix on a Shared-Memory Multiprocessor [R] . Chu, E. , George, A. 1987

机译：共享内存多处理器上密集矩阵的QR分解

Data Sharing via Differentially Private Coupled Matrix Factorization

摘要

著录项

相似文献

相关主题

期刊订阅