【24h】

Anonymity-preserving data collection

机译:保持匿名的数据收集

获取原文

摘要

Protection of privacy has become an important problem in data mining. In particular, individuals have become increasingly unwilling to share their data, frequently resulting in individuals either refusing to share their data or providing incorrect data. In turn, such problems in data collection can affect the success of data mining, which relies on sufficient amounts of accurate data in order to produce meaningful results. Random perturbation and randomized response techniques can provide some level of privacy in data collection, but they have an associated cost in accuracy. Cryptographic privacy-preserving data mining methods provide good privacy and accuracy properties. However, in order to be efficient, those solutions must be tailored to specific mining tasks, thereby losing generality.In this paper, we propose efficient cryptographic techniques for online data collection in which data from a large number of respondents is collected anonymously, without the help of a trusted third party. Thatis, our solution allows the miner to collect the original data from each respondent, but in such a way that the miner cannot link a respondent's data to the respondent. An advantage of such a solution is that, because it does not change the actual data, its success does not depend on the underlying data mining problem. We provide proofs of the correctness and privacy of our solution, as well as experimental data that demonstrates its efficiency. We also extend our solution to tolerate certain kinds of malicious behavior of the participants.
机译:隐私保护已成为数据挖掘中的重要问题。特别是,个人变得越来越不愿意共享其数据,从而经常导致个人拒绝共享其数据或提供不正确的数据。反过来,数据收集中的此类问题会影响数据挖掘的成功,数据挖掘依赖于足够数量的准确数据才能产生有意义的结果。随机扰动和随机响应技术可以在数据收集中提供一定程度的私密性,但它们在准确性方面会带来相关的成本。密码保护隐私数据挖掘方法提供了良好的隐私和准确性属性。但是,为了提高效率,必须针对特定的挖掘任务量身定制这些解决方案,从而失去通用性。在本文中,我们提出了一种有效的加密技术,用于在线数据收集,其中匿名收集了来自大量受访者的数据,而没有受信任的第三方的帮助。也就是说,我们的解决方案允许矿工从每个响应者收集原始数据,但是这种方式使得矿工无法将响应者的数据链接到响应者。这种解决方案的优点在于,由于它不会更改实际数据,因此其成功与否取决于底层数据挖掘问题。我们提供解决方案正确性和隐私性的证明,以及证明其效率的实验数据。我们还扩展了解决方案,以容忍参与者的某些恶意行为。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号