...
首页> 外文期刊>Computers & Security >A differentially private greedy decision forest classification algorithm with high utility
【24h】

A differentially private greedy decision forest classification algorithm with high utility

机译:具有高效用的差异私有贪婪决策林分类算法

获取原文
获取原文并翻译 | 示例
           

摘要

The rapid development of data analysis technologies and the easily accessible datasets have enabled the construction of a comprehensive analytics model, which can facilitate the decision makings involved in services. Meanwhile, the individual privacy preservation is of great necessity. Decision tree is a common method in medical prediction and diagnose, known for its simplicity of understanding and interpreting. However, the process of building a decision tree might cause individual privacy disclosure. Differential privacy provides a rigorous mathematical definition of privacy by controlling the risk of privacy leakage in a manageable range while maintaining the statistical characteristics. In this paper, we propose a Differentially Private Greedy Decision Forest with high utility (DPGDF) to build a privacy-preserving decision forest. In DPGDF, we design a novel budget allocation strategy that allows the nodes in greater depth get more privacy budgets in the decision tree construction process, which can, to some extent, mitigate the problem of excessive noises introduced to the leaf nodes. To aggregate multiple trees into a forest, we propose a selective aggregation method based on the prediction accuracy of the decision forest. In addition, we develop an iterative method to speed up the process of selective aggregation. Finally, we experimentally prove that the proposed DPGDF can achieve a better performance on two practical datasets compared with other algorithms.
机译:数据分析技术的快速发展和易于访问的数据集已启用综合分析模型的构建,这可以促进参与服务的决策。与此同时,个人隐私保存是巨大的必然性。决策树是医学预测和诊断的常见方法,以其简化的理解和解释而闻名。但是,构建决策树的过程可能会导致个人隐私披露。差异隐私通过控制可管理范围内的隐私泄漏的风险提供了一个严格的私隐定义,同时保持统计特征。在本文中,我们提出了一个具有高效用(DPGDF)的差别私人贪婪决策林,以建立一个隐私保留决策林。在DPGDF中,我们设计了一种新的预算分配策略,允许节点更深入地获得更多隐私预算,在决策树施工过程中可以在某种程度上减轻引入到叶节点的过度噪声的问题。将多个树木聚集到森林中,我们提出了一种基于决策林的预测准确性的选择聚合方法。此外,我们开发了一种迭代方法来加快选择性聚集过程。最后,我们通过实验证明,与其他算法相比,建议的DPGDF可以在两个实际数据集中实现更好的性能。

著录项

  • 来源
    《Computers & Security》 |2020年第9期|101930.1-101930.9|共9页
  • 作者单位

    School of Control and Computer Engineering North China Electric Power University Beijing China;

    School of Control and Computer Engineering North China Electric Power University Beijing China;

    School of Control and Computer Engineering North China Electric Power University Beijing China;

    Department of Mathematics and Computer Science Fayetteville State University Fayetteville NC USA;

    Department of Computer and Information Science Temple University Philadelphia PA USA;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Differential privacy; Decision tree; Greedy decision forest; Privacy-Preserving data mining;

    机译:差异隐私;决策树;贪婪的决策森林;保留隐私数据挖掘;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号