Confidential Boosting with Random Linear Classifiers for Outsourced User-Generated Data

机译：使用随机线性分类器对外包的用户生成数据进行机密提升

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

User-generated data is crucial to predictive modeling in many applications. With a web/mobile/wearable interface, a data owner can continuously record data generated by distributed users and build various predictive models from the data to improve its operations, services, and revenue. Due to the large size and evolving nature of users data, a data owner may rely on public cloud service providers (Cloud) for storage and computation scalability. Exposing sensitive user-generated data and advanced analytic models to Cloud raises privacy concerns. We present a confidential learning framework, SecureBoost, for data owners that want to learn predictive models from aggregated user-generated data but offload the storage and computational burden to Cloud without having to worry about protecting the sensitive data. SecureBoost allows users to submit encrypted or randomly masked data to designated Cloud directly. Our framework utilizes random linear classifiers (RLCs) as the base classifiers in the boosting framework to dramatically simplify the design of the proposed confidential protocols, yet still preserve the model quality. A Cryptographic Service Provider (CSP) is used to assist the Cloud's processing, reducing the complexity of the protocol constructions. We present two constructions of SecureBoost: HE+GC and SecSh+GC, using combinations of homomorphic encryption, garbled circuits, and random masking to achieve both security and efficiency. For a boosted model, Cloud learns only the RLCs and the CSP learns only the weights of the RLCs. Finally, the data owner collects the two parts to get the complete model. We conduct extensive experiments to understand the quality of the RLC-based boosting and the cost distribution of the constructions. Our results show that SecureBoost can efficiently learn high-quality boosting models from protected user-generated data.

机译：用户生成的数据对于许多应用程序中的预测建模至关重要。借助Web /移动设备/可穿戴设备界面，数据所有者可以连续记录分布式用户生成的数据，并根据数据构建各种预测模型，以改善其运营，服务和收入。由于用户数据的大小和不断发展的性质，数据所有者可能依赖于公共云服务提供商（Cloud）来实现存储和计算的可伸缩性。将敏感的用户生成的数据和高级分析模型暴露给Cloud会引起隐私问题。我们为数据所有者提供了一个机密的学习框架SecureBoost，这些所有者希望从聚合的用户生成的数据中学习预测模型，但又将存储和计算负担转移给了Cloud，而不必担心保护敏感数据。 SecureBoost允许用户直接将加密或随机屏蔽的数据提交到指定的Cloud。我们的框架利用随机线性分类器（RLC）作为Boosting框架中的基本分类器，以显着简化提议的机密协议的设计，同时仍保留模型质量。加密服务提供商（CSP）用于协助云的处理，从而降低了协议构建的复杂性。我们介绍了SecureBoost的两种结构：HE + GC和SecSh + GC，使用同态加密，乱码电路和随机掩蔽的组合来实现安全性和效率。对于增强模型，Cloud仅学习RLC，而CSP仅学习RLC的权重。最后，数据所有者收集两个部分以获得完整的模型。我们进行了广泛的实验，以了解基于RLC的助推器的质量和建筑的成本分配。我们的结果表明，SecureBoost可以从受保护的用户生成的数据中有效学习高质量的增强模型。

著录项

来源
《European symposium on research in computer security》|2019年|41-65|共25页
会议地点
作者
Sagar Sharma; Keke Chen;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Bagging, Boosting and the Random Subspace Method for Linear Classifiers [J] . Marina Skurichina, Robert P.W. Duin Pattern Analysis and Applications . 2002,第2期

机译：线性分类器的装袋，增强和随机子空间方法
2. A new Top-k query processing algorithm to guarantee confidentiality of data and user queries on outsourced databases [J] . Hyeong-Jin Kim, Jae-Woo Chang International journal of systems assurance engineering and management . 2019,第5期

机译：一种新的Top-k查询处理算法，可确保外包数据库中数据和用户查询的机密性
3. MONcrypt: a technique to ensure the confidentiality of outsourced data in cloud storage [J] . S.S. Manikandasaran, L. Arockiam, P.D. Sheba Kezia Malarchelvi International journal of information and computer security . 2019,第1期

机译：MONcrypt：一种确保云存储中外包数据的机密性的技术
4. Confidential Boosting with Random Linear Classifiers for Outsourced User-Generated Data [C] . Sagar Sharma, Keke Chen European symposium on research in computer security . 2019

机译：用随机线性分类器进行外包用户生成数据的机密提升
5. An efficient boosted classifier tree-based feature point tracking system for facial expression analysis. [D] . Livingston, Adam Redd. 2012

机译：一个高效的基于分类器树的增强分类器特征点跟踪系统，用于面部表情分析。
6. Classification of Potential Water Bodies Using Landsat 8 OLI and a Combination of Two Boosted Random Forest Classifiers [O] . Byoung Chul Ko, Hyeong Hun Kim, Jae Yeal Nam 2015

机译：使用Landsat 8 OLI和两个增强型随机森林分类器的组合对潜在水体进行分类
7. Confidentiality-Preserving Query Execution of Fragmented Outsourced Data [O] . Bkakria, Anis, Cuppens, Frédéric, CUPPENS-BOULAHIA, Nora, 2013

机译：碎片化外包数据的保密性查询执行

Confidential Boosting with Random Linear Classifiers for Outsourced User-Generated Data

摘要

著录项

相似文献

相关主题

期刊订阅