首页> 外文会议>European symposium on research in computer security >Confidential Boosting with Random Linear Classifiers for Outsourced User-Generated Data
【24h】

Confidential Boosting with Random Linear Classifiers for Outsourced User-Generated Data

机译:使用随机线性分类器对外包的用户生成数据进行机密提升

获取原文

摘要

User-generated data is crucial to predictive modeling in many applications. With a web/mobile/wearable interface, a data owner can continuously record data generated by distributed users and build various predictive models from the data to improve its operations, services, and revenue. Due to the large size and evolving nature of users data, a data owner may rely on public cloud service providers (Cloud) for storage and computation scalability. Exposing sensitive user-generated data and advanced analytic models to Cloud raises privacy concerns. We present a confidential learning framework, SecureBoost, for data owners that want to learn predictive models from aggregated user-generated data but offload the storage and computational burden to Cloud without having to worry about protecting the sensitive data. SecureBoost allows users to submit encrypted or randomly masked data to designated Cloud directly. Our framework utilizes random linear classifiers (RLCs) as the base classifiers in the boosting framework to dramatically simplify the design of the proposed confidential protocols, yet still preserve the model quality. A Cryptographic Service Provider (CSP) is used to assist the Cloud's processing, reducing the complexity of the protocol constructions. We present two constructions of SecureBoost: HE+GC and SecSh+GC, using combinations of homomorphic encryption, garbled circuits, and random masking to achieve both security and efficiency. For a boosted model, Cloud learns only the RLCs and the CSP learns only the weights of the RLCs. Finally, the data owner collects the two parts to get the complete model. We conduct extensive experiments to understand the quality of the RLC-based boosting and the cost distribution of the constructions. Our results show that SecureBoost can efficiently learn high-quality boosting models from protected user-generated data.
机译:用户生成的数据对于许多应用程序中的预测建模至关重要。借助Web /移动设备/可穿戴设备界面,数据所有者可以连续记录分布式用户生成的数据,并根据数据构建各种预测模型,以改善其运营,服务和收入。由于用户数据的大小和不断发展的性质,数据所有者可能依赖于公共云服务提供商(Cloud)来实现存储和计算的可伸缩性。将敏感的用户生成的数据和高级分析模型暴露给Cloud会引起隐私问题。我们为数据所有者提供了一个机密的学习框架SecureBoost,这些所有者希望从聚合的用户生成的数据中学习预测模型,但又将存储和计算负担转移给了Cloud,而不必担心保护敏感数据。 SecureBoost允许用户直接将加密或随机屏蔽的数据提交到指定的Cloud。我们的框架利用随机线性分类器(RLC)作为Boosting框架中的基本分类器,以显着简化提议的机密协议的设计,同时仍保留模型质量。加密服务提供商(CSP)用于协助云的处理,从而降低了协议构建的复杂性。我们介绍了SecureBoost的两种结构:HE + GC和SecSh + GC,使用同态加密,乱码电路和随机掩蔽的组合来实现安全性和效率。对于增强模型,Cloud仅学习RLC,而CSP仅学习RLC的权重。最后,数据所有者收集两个部分以获得完整的模型。我们进行了广泛的实验,以了解基于RLC的助推器的质量和建筑的成本分配。我们的结果表明,SecureBoost可以从受保护的用户生成的数据中有效学习高质量的增强模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号