首页> 外文会议>IEEE International Conference on Computer Communications >Learning Privately: Privacy-Preserving Canonical Correlation Analysis for Cross-Media Retrieval
【24h】

Learning Privately: Privacy-Preserving Canonical Correlation Analysis for Cross-Media Retrieval

机译:私下学习:跨媒体检索的隐私保留规范相关分析

获取原文

摘要

A massive explosion of various types of data has been triggered in the "Big Data" era. In big data systems, machine learning plays an important role due to its effectiveness in discovering hidden information and valuable knowledge. Data privacy, however, becomes an unavoidable concern since big data usually involve multiple organizations, e.g., different healthcare systems and hospitals, who are not in the same trust domain and may be reluctant to share their data publicly. Applying traditional cryptographic tools is a straightforward approach to protect sensitive information, but it often renders learning algorithms useless inevitably. In this work, we, for the first time, propose a novel privacy-preserving scheme for canonical correlation analysis (CCA), which is a well-known learning technique and has been widely used in cross-media retrieval system. We first develop a library of building blocks to support various arithmetics over encrypted real numbers by leveraging additively homomorphic encryption and garbled circuits. Then we encrypt private data by randomly splitting the numerical data, formalize CCA problem and reduce it to a symmetric eigenvalue problem by designing new protocols for privacy-preserving QR decomposition. Finally, we solve all the eigenvalues and the corresponding eigenvectors by running Newton-Raphson method and inverse power method over the ciphertext domain. We carefully analyze the security and extensively evaluate the effectiveness of our design. The results show that our scheme is practically secure, incurs negligible errors compared with performing CCA in the clear and performs comparably in cross-media retrieval systems.
机译:在“大数据”时代,各种数据的大规模爆炸已经触发。在大数据系统中,由于发现隐藏信息和宝贵知识的有效性,机器学习起着重要作用。然而,数据隐私成为一种不可避免的问题,因为大数据通常涉及多个组织,例如不同的医疗系统和医院,他们不在同一信任领域,可能不愿公开分享他们的数据。应用传统加密工具是一种保护敏感信息的直接方法,但它通常不可避免地呈现学习算法。在这项工作中,我们首次提出了一种用于规范相关分析(CCA)的新型隐私保留方案,这是一种知名的学习技术,并且已广泛用于跨媒检索系统。我们首先通过利用含有含有瘾的同性恋加密和乱码的电路来开发一个构建块的库,以支持通过加密的实数通过加密的实数。然后,我们通过随机分割数值数据来加密私有数据,通过设计新的保留QR分解的新协议来将CCA问题正式化CCA问题并将其降低到对称的特征值问题。最后,我们通过在密文域上运行牛顿-Raphson方法和逆电源方法来解决所有特征值和相应的特征向量。我们仔细分析了安全性,并广泛评估了我们设计的有效性。结果表明,我们的方案实际上是安全的,与在透明的CLCA中执行CCA的误差会忽略不计,并且在跨媒体检索系统中相对执行。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号