首页> 外文期刊>Scientific programming >QuPiD Attack: Machine Learning-Based Privacy Quantification Mechanism for PIR Protocols in Health-Related Web Search
【24h】

QuPiD Attack: Machine Learning-Based Privacy Quantification Mechanism for PIR Protocols in Health-Related Web Search

机译:Qupid攻击:健康相关网站搜索中的PIR协议的基于机器的隐私量化机制

获取原文
           

摘要

With the advancement in ICT, web search engines have become a preferred source to find health-related information published over the Internet. Google alone receives more than one billion health-related queries on a daily basis. However, in order to provide the results most relevant to the user, WSEs maintain the users’ profiles. These profiles may contain private and sensitive information such as the user’s health condition, disease status, and others. Health-related queries contain privacy-sensitive information that may infringe user’s privacy, as the identity of a user is exposed and may be misused by the WSE and third parties. This raises serious concerns since the identity of a user is exposed and may be misused by third parties. One well-known solution to preserve privacy involves issuing the queries via peer-to-peer private information retrieval protocol, such as useless user profile (UUP), thereby hiding the user’s identity from the WSE. This paper investigates the level of protection offered by UUP. For this purpose, we present QuPiD (query profile distance) attack: a machine learning-based attack that evaluates the effectiveness of UUP in privacy protection. QuPiD attack determines the distance between the user’s profile (web search history) and upcoming query using our proposed novel feature vector. The experiments were conducted using ten classification algorithms belonging to the tree-based, rule-based, lazy learner, metaheuristic, and Bayesian families for the sake of comparison. Furthermore, two subsets of an America Online dataset (noisy and clean datasets) were used for experimentation. The results show that the proposed QuPiD attack associates more than 70% queries to the correct user with a precision of over 72% for the clean dataset, while for the noisy dataset, the proposed QuPiD attack associates more than 40% queries to the correct user with 70% precision.
机译:随着ICT的进步,Web搜索引擎已成为在互联网上发布的健康相关信息的首选来源。 Google单独每天收到超过10亿的健康有关的查询。但是,为了提供与用户最相关的结果,WSES维护用户的配置文件。这些简档可能包含私人和敏感的信息,例如用户的健康状况,疾病状态和其他信息。与健康有关的查询包含可能侵犯用户隐私的隐私敏感信息,因为用户的身份暴露,并且可能被WSE和第三方滥用。由于用户的身份暴露并且可能被第三方滥用,这提高了严重问题。一种众所周知的保护隐私的解决方案涉及通过对等私人信息检索协议来发出查询,例如无用的用户简档(UUP),从而掩盖了来自WSE的用户的身份。本文调查了UUP提供的保护水平。为此目的,我们呈现Qupid(查询配置文件距离)攻击:基于机器的基于机器的攻击,评估UUP在隐私保护中的有效性。 QUPID攻击确定用户的个人资料(Web搜索历史记录)与即将使用我们所提出的新颖特征向量的查询之间的距离。该实验是使用属于基于树,规则,懒惰的学习者,成群质主义和贝叶斯家族的十个分类算法进行的。此外,美国在线数据集(嘈杂和清洁数据集)的两个子集用于实验。结果表明,所提出的Qupid攻击将超过70%的查询与SPEIL DataSet的精度超过72%以上超过72%,而对于嘈杂的数据集,所提出的Qupid攻击将为正确的用户关联超过40%的查询精度70%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号