首页> 外文期刊>MIS quarterly >Digression and Value Concatenation to Enable Privacy-Preserving Regression
【24h】

Digression and Value Concatenation to Enable Privacy-Preserving Regression

机译:离题和值级联以实现隐私保护回归

获取原文
获取原文并翻译 | 示例
       

摘要

Regression techniques can be used not only for legitimate data analysis, but also to infer private information about individuals. In this paper, we demonstrate that regression trees, a popular data-analysis and data-mining technique, can be used to effectively reveal individuals' sensitive data. This problem, which we call a regression attack, has not been addressed in the data privacy literature, and existing privacy-preserving techniques are not appropriate in coping with this problem. We propose a new approach to counter regression attacks. To protect against privacy disclosure, our approach introduces a novel measure, called digression, which assesses the sensitive value disclosure risk in the process of building a regression tree model. Specifically, we develop an algorithm that uses the measure for pruning the tree to limit disclosure of sensitive data. We also propose a dynamic value-concatenation method for anonymizing data, which better preserves data utility than a user-defined generalization scheme commonly used in existing approaches. Our approach can be used for anonymizing both numeric and categorical data. An experimental study is conducted using real-world financial, economic, and healthcare data. The results of the experiments demonstrate that the proposed approach is very effective in protecting data privacy while preserving data quality for research and analysis.
机译:回归技术不仅可以用于合法数据分析,还可以用于推断有关个人的私人信息。在本文中,我们证明了回归树是一种流行的数据分析和数据挖掘技术,可以用来有效地揭示个人的敏感数据。数据隐私文献中尚未解决这个称为回归攻击的问题,并且现有的隐私保护技术不适用于解决此问题。我们提出了一种对抗回归攻击的新方法。为了防止隐私泄露,我们的方法引入了一种称为离题的新方法,该方法在建立回归树模型的过程中评估敏感值披露风险。具体来说,我们开发了一种使用该方法修剪树以限制敏感数据公开的算法。我们还提出了一种用于对数据进行匿名处理的动态值连接方法,该方法比现有方法中常用的用户定义泛化方案更好地保留了数据实用性。我们的方法可用于匿名化数字和分类数据。使用真实的金融,经济和医疗保健数据进行实验研究。实验结果表明,所提出的方法在保护数据隐私的同时保持数据质量以供研究和分析非常有效。

著录项

  • 来源
    《MIS quarterly》 |2014年第3期|679-698|共20页
  • 作者

    Xiao-Bai Li; Sumit Sarkar;

  • 作者单位

    Department of Operations and Information Systems, Manning School of Business, University of Massachusetts Lowell, Lowell, MA 01854 U.S.A.;

    Naveen Jindal School of Management, University of Texas at Dallas, Richardson, TX 75080 U.S.A.;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Privacy; data analytics; data mining; regression; regression trees; anonymization;

    机译:隐私;数据分析;数据挖掘;回归回归树;匿名化;
  • 入库时间 2022-08-17 13:16:45

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号