首页> 外文期刊>Information Sciences: An International Journal >Utility preserving query log anonymization via semantic microaggregation
【24h】

Utility preserving query log anonymization via semantic microaggregation

机译:实用程序通过语义微聚合保留查询日志匿名化

获取原文
获取原文并翻译 | 示例
           

摘要

Query logs are of great interest for scientists and companies for research, statistical and commercial purposes. However, the availability of query logs for secondary uses raises privacy issues since they allow the identification and/or revelation of sensitive information about individual users. Hence, query anonymization is crucial to avoid identity disclosure. To enable the publication of privacy-preserved - but still useful - query logs, in this paper, we present an anonymization method based on semantic microaggregation. Our proposal aims at minimizing the disclosure risk of anonymized query logs while retaining their semantics as much as possible. First, a method to map queries to their formal semantics extracted from the structured categories of the Open Directory Project is presented. Then, a microaggregation method is adapted to perform a semantically-grounded anonymization of query logs. To do so, appropriate semantic similarity and semantic aggregation functions are proposed. Experiments performed using real AOL query logs show that our proposal better retains the utility of anonymized query logs than other related works, while also minimizing the disclosure risk.
机译:查询日志对于科学家和公司进行研究,统计和商业用途非常感兴趣。但是,可用于二次用途的查询日志会引起隐私问题,因为它们允许识别和/或显示有关各个用户的敏感信息。因此,查询匿名化对于避免身份泄露至关重要。为了能够发布保留隐私但仍然有用的查询日志,在本文中,我们提出了一种基于语义微聚合的匿名化方法。我们的建议旨在将匿名查询日志的公开风险降至最低,同时尽可能保留其语义。首先,提出了一种将查询映射到从Open Directory Project的结构化类别中提取的形式语义的方法。然后,微聚合方法适用于执行查询日志的语义基础匿名化。为此,提出了适当的语义相似性和语义聚合功能。使用实际的AOL查询日志进行的实验表明,与其他相关工作相比,我们的提案更好地保留了匿名查询日志的实用性,同时还最大程度地降低了披露风险。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号