首页> 外文会议>European Symposium on Research in Computer Security >A Machine Learning Approach for Detecting Third-Party Trackers on the Web
【24h】

A Machine Learning Approach for Detecting Third-Party Trackers on the Web

机译:一种机器学习方法,用于检测网络上的第三方跟踪器

获取原文

摘要

Nowadays, privacy violation caused by third-party tracking has become a serious problem and yet the most effective method to defend against third-party tracking is based on blacklists. Such method highly depends on the quality of the blacklist database, whose records need to be updated frequently. However, most records are curated manually and very difficult to maintain. To efficiently generate blacklists, we propose a system with high accuracy, named DMTrackerDetector, to detect third-party trackers automatically. Existing methods to detect online tracking have two shortcomings. Firstly, they treat first-party tracking and third-party tracking the same. Secondly, they always focus on a certain way of tracking and can only detect limited trackers. Since anti-tracking technology based on blacklists highly depends on the coverage of the blacklist database, these methods cannot generate high-quality blacklists. To solve these problems, we firstly use the structural hole theory to preserve first-party trackers, and only detect third-party trackers based on supervised machine learning by exploiting the fact that trackers and non-trackers always call different JavaScript APIs for different purposes. The results show that 97.8% of the third-party trackers in our test set can be correctly detected. The blacklist generated by our system not only covers almost all records in the Ghostery list (one of the most popular anti-tracking tools), but also detects 35 unrevealed trackers.
机译:如今,第三方追踪造成的隐私违规已成为一个严重的问题,然而,抵御第三方跟踪的最有效的方法是基于黑名单。这种方法高度取决于黑名单数据库的质量,其记录需要经常更新。但是,大多数记录都是手动策划,非常难以维护。为了有效地生成黑名单,我们提出了一个具有高精度,名为DMTrackerDetector的系统,自动检测第三方跟踪器。检测在线跟踪的现有方法有两个缺点。首先,他们对待一方追踪和第三方跟踪相同。其次,它们总是专注于某种跟踪方式,只能检测有限的跟踪器。由于基于黑名单的反跟踪技术高度取决于黑名单数据库的覆盖范围,因此这些方法无法生成高质量的黑名单。为了解决这些问题,我们首先使用结构孔理论保留一方的追踪器,只能通过利用追踪器和非跟踪器始终调用不同的JavaScript API来基于监督机器学习来检测第三方跟踪器。结果表明,可以正确地检测到我们的测试集中的第三方跟踪器的97.8%。我们的系统生成的黑名单不仅涵盖了Ghostery列表中的几乎所有记录(最受欢迎的反跟踪工具之一),而且还检测到35个未伪造的跟踪器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号