Classifying Sensitive Content in Online Advertisements with Deep Learning

机译：在线广告与深度学习的敏感内容

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In online advertising, an important quality control step is to audit advertising images ("creatives") before they appear on publishers' webpages. This ensures that advertisements only appear on webpages where the ad is appropriate. Assigning the correct sensitive categories to each creative - such as alcohol, tobacco, etc. - is one of the most important aspects to get correct. If a sensitive creative is displayed on the wrong webpage, it can ruin the user's experience, the publisher's reputation, and may have legal implications. To protect against this, humans audit every creative before it is displayed through our ad exchange; this process is costly and time consuming. This paper explains how we automated sensitive category detection. To detect whether a creative has any sensitive content, we use a pre-trained deep convolutional neural network (Xception [1]) to process the creative image and merge this with the historical distribution of sensitive categories associated with the creative's landing page (the webpage that loads when the ad is clicked, which may also contain sensitive content). This representation is then passed into a series of fully connected layers to make a prediction of whether a creative belongs to a sensitive category. We show in offline testing that this model achieves slightly better than human performance (model accuracy 99.92%; human accuracy 99.88%) on a large fraction of creatives (61%) while making 3.5 times fewer mistakes in certain categories for which mistakes are especially costly. These results changed somewhat when deploying this model at scale in production, where a small modification resulted in classifying fewer creatives than estimated offline, with approximately the same accuracy (52% classified with 99.87% accuracy).

机译：在在线广告中，一个重要的质量控制步骤是在出版商网页上出现之前审核广告图像（“Creatives”）。这可确保广告仅出现在广告适当的网页上。将正确的敏感类别分配给每个创意 - 例如酒精，烟草等 - 是最重要的方面之一。如果在错误的网页上显示敏感的创意，它可能会破坏用户的体验，发行商的声誉，并且可能具有法律影响。为了防止这一点，人类在通过广告交换之前展示之前的每一个创意;这个过程昂贵且耗时。本文介绍了我们如何自动化敏感类别检测。为了检测创意是否具有任何敏感内容，我们使用预先训练的深度卷积神经网络（七脚镜[1]）来处理创意图像并将其与与Creative的着陆页面相关的敏感类别的历史分布合并（网页点击广告时加载，也可能包含敏感内容）。然后将该表示传递到一系列完全连接的层中，以预测创意属于敏感类别。我们在离线测试中展示了这种模型略高于人类性能（模型准确性99.92 ％;人类准确性99.88 ％）在大部分的造型（61 ％），同时在某些类别中制作误报的3.5倍特别昂贵。这些结果在生产范围内部署此模型时有点变化，其中小型修改导致较少的创造性，而不是估计的离线，具有大致相同的准确性（52 ％以99.87 ％精度分类）。

著录项

来源
《IEEE International Conference on Data Science and Advanced Analytics》|2019年|696p|共8页
会议地点
作者
Ashutosh Sanzgiri; Daniel Austin; Kannan Sankaran; Ryan Woodard; Amit Lissack; Sam Seljan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类总体结构、系统结构;
关键词
Advertising; Data models; Deep learning; Convolutional neural networks; Production; Training; Law;

机译：广告;数据模型;深入学习;卷积神经网络;生产;培训;法律;

相似文献

外文文献
中文文献
专利

1. Classifying sensitive content in online advertisements with deep learning [J] . Daniel Austin, Ashutosh Sanzgiri, Kannan Sankaran, International Journal of Data Science and Analytics . 2020,第3期

机译：在线广告与深度学习的敏感内容
2. Classifying online Job Advertisements through Machine Learning [J] . Roberto Boselli, Mirko Cesarini, Fabio Mercorio, Future generation computer systems . 2018,第SEPa期

机译：通过机器学习对在线招聘广告进行分类
3. Classifying and clustering malicious advertisement uniform resource locators using deep learning [J] . Zhang Xichen, Lashkari Arash Habibi, Ghorbani Ali A. Computational Intelligence . 2021,第1期

机译：使用深度学习对恶意广告统一资源定位器进行分类和聚类
4. Classifying Sensitive Content in Online Advertisements with Deep Learning [C] . Ashutosh Sanzgiri, Daniel Austin, Kannan Sankaran, IEEE International Conference on Data Science and Advanced Analytics . 2019

机译：在线广告与深度学习的敏感内容
5. Trust and Profit Sensitive Ranking for the Deep Web and On-line Advertisements [D] . Balakrishnan, Raju 2012

机译：深度网络和在线广告的信任度和利润敏感性排名
6. Online classifier adaptation for cost-sensitive learning [O] . Zhang, Junlin, Garcia, Jose 2015

机译：在线分类器适用于成本敏感的学习

Classifying Sensitive Content in Online Advertisements with Deep Learning

摘要

著录项

相似文献

相关主题

期刊订阅