首页> 外文学位 >A Utility-Aware Privacy Preserving Framework For Distributed Data Mining With Worst Case Privacy Guarantee.
【24h】

A Utility-Aware Privacy Preserving Framework For Distributed Data Mining With Worst Case Privacy Guarantee.

机译:一个实用程序感知的隐私保护框架,用于具有最坏情况隐私保证的分布式数据挖掘。

获取原文
获取原文并翻译 | 示例

摘要

Data Mining is the task of finding meaningful patterns from huge amount of data. With the enormous growth of data and their distributed nature, storage of data and analysis of data are often separated, therefore developing the need for the research area of privacy preserving data mining. In other words, privacy preserving data mining is needed when the data is private in nature and revealing of sensitive information needs to the prevented, while still allowing mining of the data with reasonable accuracy. Various data perturbation techniques exist in literature for this purpose. One significant drawback with the existing methods is that they handle average case privacy scenario. But, while dealing with private and business data it would definitely be beneficial to have a privacy framework that would provide a certain guarantee that the data would not be divulged in the worst case.;In a distributed data setting, it is often beneficial for organizations to collaboratively perform data mining tasks without giving up their own data. This necessity has developed the research areas of secure multiparty computation and privacy preserving distributed data mining. There exist several protocols that deal with data mining tasks in a distributed scenario but most of these techniques handle a single data mining method. Therefore, if the participating parties are interested in more than one classification methods they will have to go through a series of distributed protocols every time for a different method thus increasing the overhead substantially.;Another critical problem with the existing privacy protection techniques is that they do not take the data mining tasks that will be performed on the perturbed data into consideration thus reducing the utility of the perturbation techniques substantially. In a distributed setting the parties are aware of the data mining tasks they would need to perform collaboratively. For example, the collaborative parties are aware that they are building a classification model or predicting an attribute. Therefore, if the data perturbation methods can be pruned according to the need of the end user the utility of the privacy protection techniques can be increased significantly.;Here, in this dissertation multiple privacy preserving data mining algorithms have been proposed for multiple data mining methods that address all these above mentioned issues and provide a utility aware approach to privacy preserving data mining in a centralized as well as distributed scenario with worst case privacy guarantee. These algorithms will also allow the end user to perform exploratory data analysis on the perturbed data. Detailed experimental results will demonstrate the effectiveness of these techniques.
机译:数据挖掘是从大量数据中寻找有意义的模式的任务。随着数据的巨大增长及其分布性质,数据的存储和数据的分析常常是分开的,因此需要对隐私保护数据挖掘的研究领域进行开发。换句话说,当数据本质上是私有的并且需要防止敏感信息的泄露时,就需要保护隐私的数据挖掘,同时仍然允许以合理的准确性挖掘数据。为此目的,文献中存在各种数据扰动技术。现有方法的一个显着缺点是它们处理普通情况下的隐私方案。但是,在处理私人和企业数据时,拥有一个隐私框架绝对可以带来好处,该框架可以确保在最坏的情况下不会泄露数据;在分布式数据设置中,这通常对组织有利协作执行数据挖掘任务而不会放弃自己的数据。因此有必要发展安全多方计算和隐私保护分布式数据挖掘的研究领域。在分布式场景中,有几种协议可以处理数据挖掘任务,但是其中大多数技术只能处理一种数据挖掘方法。因此,如果参与方对一种以上的分类方法感兴趣,则他们每次必须为不同的方法进行一系列分布式协议,从而大大增加了开销。现有的隐私保护技术的另一个关键问题是它们不要考虑将要在被扰动的数据上执行的数据挖掘任务,从而大大减少了扰动技术的效用。在分布式环境中,各方都知道他们需要协同执行的数据挖掘任务。例如,协作方知道他们正在建立分类模型或预测属性。因此,如果可以根据最终用户的需求来修剪数据扰动方法,则可以显着提高隐私保护技术的实用性。;在此,本文针对多种数据挖掘方法提出了多种隐私保护数据挖掘算法。该解决方案解决了上述所有这些问题,并提供了一种实用程序感知的方法,可在最坏情况下确保隐私的集中式和分布式方案中保护隐私数据挖掘。这些算法还将允许最终用户对受扰数据执行探索性数据分析。详细的实验结果将证明这些技术的有效性。

著录项

  • 作者

    Banerjee, Madhushri.;

  • 作者单位

    University of Maryland, Baltimore County.;

  • 授予单位 University of Maryland, Baltimore County.;
  • 学科 Information Technology.;Information Science.
  • 学位 Ph.D.
  • 年度 2011
  • 页码 132 p.
  • 总页数 132
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号