Logistic regression model training based on the approximate homomorphic encryption

Andrey Kim; Yongsoo Song; Miran Kim; Keewoo Lee; Jung Hee Cheon

首页> 外文期刊>BMC Medical Genomics >Logistic regression model training based on the approximate homomorphic encryption

【24h】

Logistic regression model training based on the approximate homomorphic encryption

机译：基于近似同态加密的逻辑回归模型训练

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Security concerns have been raised since big data became a prominent tool in data analysis. For instance, many machine learning algorithms aim to generate prediction models using training data which contain sensitive information about individuals. Cryptography community is considering secure computation as a solution for privacy protection. In particular, practical requirements have triggered research on the efficiency of cryptographic primitives. This paper presents a method to train a logistic regression model without information leakage. We apply the homomorphic encryption scheme of Cheon et al. (ASIACRYPT 2017) for an efficient arithmetic over real numbers, and devise a new encoding method to reduce storage of encrypted database. In addition, we adapt Nesterov’s accelerated gradient method to reduce the number of iterations as well as the computational cost while maintaining the quality of an output classifier. Our method shows a state-of-the-art performance of homomorphic encryption system in a real-world application. The submission based on this work was selected as the best solution of Track 3 at iDASH privacy and security competition 2017. For example, it took about six minutes to obtain a logistic regression model given the dataset consisting of 1579 samples, each of which has 18 features with a binary outcome variable. We present a practical solution for outsourcing analysis tools such as logistic regression analysis while preserving the data confidentiality.

机译：自从大数据成为数据分析中的重要工具以来，就引发了安全方面的担忧。例如，许多机器学习算法旨在使用包含有关个人的敏感信息的训练数据来生成预测模型。密码学界正在考虑将安全计算作为隐私保护的解决方案。特别是，实际需求已触发了对密码原语效率的研究。本文提出了一种在不泄漏信息的情况下训练逻辑回归模型的方法。我们应用Cheon等人的同态加密方案。（ASIACRYPT 2017）进行有效的实数运算，并设计了一种新的编码方法以减少加密数据库的存储。此外，我们采用Nesterov的加速梯度方法，以减少迭代次数和计算成本，同时保持输出分类器的质量。我们的方法显示了在实际应用中同态加密系统的最新性能。基于此项工作的提交被选为2017年iDASH隐私和安全竞赛第3道的最佳解决方案。例如，假设数据集包含1579个样本，每个样本有18个样本，则花费大约六分钟的时间来获得逻辑回归模型。具有二进制结果变量的特征。我们为外包分析工具（例如逻辑回归分析）提供了一种实用的解决方案，同时又保持了数据的机密性。

著录项

来源
《BMC Medical Genomics 》 |2018年第4期| 共页
作者
Andrey Kim; Yongsoo Song; Miran Kim; Keewoo Lee; Jung Hee Cheon;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类遗传学 ;
关键词
Homomorphic encryptionMachine learningLogistic regression;

机译：同态加密机器学习逻辑回归;

相似文献

外文文献
中文文献
专利

1. Privacy-preserving semi-parallel logistic regression training with fully homomorphic encryption [J] . Sergiu Carpov, Nicolas Gama, Mariya Georgieva, BMC Medical Genomics . 2020 ,第7期

机译：完全同态加密的隐私保留半行逻辑回归训练
2. Secure Logistic Regression Based on Homomorphic Encryption: Design and Evaluation [J] . Miran Kim, Yongsoo Song, Shuang Wang, JMIR Medical Informatics . 2018 ,第2期

机译：基于同态加密的安全逻辑回归：设计与评估
3. Privacy-preserving Online Logistic Regression Based on Homomorphic Encryption [J] . Shuang WU, Junpei KAWAMOTO, Hiroaki KIKUCHI, 電子情報通信学会技術研究報告. 情報論的学習理論と機械学習 . 2013 ,第139期

机译：基于同态加密的隐私保护在线逻辑回归
4. Homomorphic Training of 30,000 Logistic Regression Models [C] . Flavio Bergamaschi, Shai Halevi, Tzipora T. Halevi, International conference on applied cryptography and network security . 2019

机译：30,000种Logistic回归模型的同态训练
5. Prediction of Foreign Object Debris/Damage type based in human factors for aeronautics using logistic regression model. [D] . Romo, David Ricardo. 2013

机译：使用逻辑回归模型，基于人为因素对航空业的异物碎片/损伤类型进行预测。
6. Logistic regression model training based on the approximate homomorphic encryption [O] . Andrey Kim, Yongsoo Song, Miran Kim, 2018

机译：基于近似同态加密的逻辑回归模型训练
7. Logistic regression model training based on the approximate homomorphic encryption [O] . Andrey Kim, Yongsoo Song, Miran Kim, 2018

机译：基于近似均匀加密的逻辑回归模型培训

Logistic regression model training based on the approximate homomorphic encryption

摘要

著录项

相似文献

相关主题

期刊订阅