首页> 外文期刊>Knowledge and information systems >A distributed approach to enabling privacy-preserving model-based classifier training
【24h】

A distributed approach to enabling privacy-preserving model-based classifier training

机译:一种实现基于隐私保护模型的分类器训练的分布式方法

获取原文
获取原文并翻译 | 示例
           

摘要

This paper proposes a novel approach for privacy-preserving distributed model-based classifier training. Our approach is an important step towards supporting customizable privacy modeling and protection. It consists of three major steps. First, each data site independently learns a weak concept model (i.e., local classifier) for a given data pattern or concept by using its own training samples. An adaptive EM algorithm is proposed to select the model structure and estimate the model parameters simultaneously. The second step deals with combined classifier training by integrating the weak concept models that are shared from multiple data sites. To reduce the data transmission costs and the potential privacy breaches, only the weak concept models are sent to the central site and synthetic samples are directly generated from these shared weak concept models at the central site. Both the shared weak concept models and the synthetic samples are then incorporated to learn a reliable and complete global concept model. A computational approach is developed to automatically achieve a good trade off between the privacy disclosure risk, the sharing benefit and the data utility. The third step deals with validating the combined classifier by distributing the global concept model to all these data sites in the collaboration network while at the same time limiting the potential privacy breaches. Our approach has been validated through extensive experiments carried out on four UCI machine learning data sets and two image data sets.
机译:本文提出了一种新的基于隐私保护的分布式基于模型的分类器训练方法。我们的方法是朝着支持可定制的隐私建模和保护迈出的重要一步。它包括三个主要步骤。首先,每个数据站点都使用自己的训练样本针对给定的数据模式或概念独立学习弱概念模型(即本地分类器)。提出了一种自适应EM算法来选择模型结构并同时估计模型参数。第二步通过集成从多个数据站点共享的弱概念模型来处理组合分类器训练。为了降低数据传输成本和潜在的隐私漏洞,仅将弱概念模型发送到中心站点,并从这些共享的弱概念模型在中心站点直接生成合成样本。然后,将共享的弱概念模型和综合样本都合并以​​学习可靠且完整的全局概念模型。开发了一种计算方法来自动实现隐私公开风险,共享利益和数据实用程序之间的良好折衷。第三步涉及通过将全局概念模型分布到协作网络中的所有这些数据站点来验证组合分类器,同时限制潜在的隐私泄露。通过对四个UCI机器学习数据集和两个图像数据集进行的广泛实验,我们的方法得到了验证。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号