首页> 外文会议>IEEE International Conference on Big Data Science and Engineering >Privacy-Preserving Query Log Sharing Based on Prior N-Word Aggregation
【24h】

Privacy-Preserving Query Log Sharing Based on Prior N-Word Aggregation

机译:基于先前N字聚合的隐私保留查询日志共享

获取原文

摘要

Privacy-preserving query log sharing has attracted considerable attention especially after the incident of AOL privacy leakage. Queries and URLs of the query logs reflect user preferences, which can help to increase the quality of personalized services. However, these logs may disclose users' sensitive information, and thus need to be sanitized before publication. The existing solutions focused on how to sample records to satisfy differential privacy guarantee and to hide individual preferences in the sampled query logs by perturbation. However, all of them suffer from leakage in queries and URL access, as well as extra redundancy. In this work, we propose to greedily select samples with high utility and provide privacy guarantee by prior estimation of n-word phrase utility. In the estimation, we utilize novel metrics to conduct differential semantic aggregation and to select the representative in each cluster, which can help to achieve the objective of leaking less privacy and releasing more useful information. Extensive experiments on real-world datasets demonstrate the utility of our solutions without compromising individual privacy, and released query logs have been applied to personalized search.
机译:保护保留查询日志共享在AOL隐私泄漏事件发生后,特别引起了相当大的关注。查询日志的查询和URL反映了用户首选项,可以帮助提高个性化服务的质量。然而,这些日志可以揭示用户的敏感信息,因此需要在发布之前消毒。现有的解决方案专注于如何采样记录以满足差异隐私保证,并通过扰动隐藏采样查询日志中的单个偏好。但是,所有这些都遭受查询和URL访问的泄漏,以及额外的冗余。在这项工作中,我们建议贪婪地选择具有高效的样本,并通过先前估计N字短语效用提供隐私保证。在估计中,我们利用新颖的指标进行差异语义聚合,并在每个群集中选择代表,这有助于实现泄漏较少隐私并释放更有用的信息的目标。对现实世界数据集的广泛实验展示了我们解决方案的效用而不会影响个人隐私,并发布查询日志已应用于个性化搜索。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号