Reversing the effects of tokenisation attacks against content-based spam filters

Igor Santos; Carlos Laorden; Borja Sanz; Pablo G. Bringas

首页> 外文期刊>International Journal of Security and Networks >Reversing the effects of tokenisation attacks against content-based spam filters

【24h】

Reversing the effects of tokenisation attacks against content-based spam filters

机译：扭转针对基于内容的垃圾邮件过滤器的令牌化攻击的影响

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

More than 85% of the received emails are spam. Many current solutions feature machine-learning algorithms trained using statistical representations of the terms that most commonly appear in such emails. However, there are attacks that can subvert the filtering capabilities of these methods. Tokenisation attacks insert characters within words, subverting these methods. In this paper, we introduce a new method that reverses the effects of tokenisation attacks. Our method processes emails iteratively by considering possible words, starting from the first token and compares the word candidates with a common dictionary to which spam words have been previously added. We provide an empirical study of how tokenisation attacks affect the filtering capability of a Bayesian classifier and we show that our method can reverse the effects of tokenisation attacks.

机译：超过85％的电子邮件是垃圾邮件。当前许多解决方案均采用机器学习算法进行训练，这些算法是使用此类电子邮件中最常见的术语的统计表示进行训练的。但是，有些攻击可能会破坏这些方法的过滤功能。令牌化攻击在单词中插入字符，从而颠覆了这些方法。在本文中，我们介绍了一种新的方法，可以逆转令牌化攻击的影响。我们的方法从第一个令牌开始，通过考虑可能的单词来迭代处理电子邮件，并将候选单词与之前已添加垃圾邮件单词的公共词典进行比较。我们提供了关于令牌化攻击如何影响贝叶斯分类器的过滤能力的经验研究，并且我们证明了我们的方法可以逆转令牌化攻击的影响。

著录项

来源
《International Journal of Security and Networks》 |2013年第2期|共11页
作者
Igor Santos; Carlos Laorden; Borja Sanz; Pablo G. Bringas;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
Spam; Security; Attacks; Tokenisation;

机译：垃圾邮件;安全性;攻击;令牌化;

相似文献

外文文献
中文文献
专利

1. Reversing the effects of tokenisation attacks against content-based spam filters [J] . Igor Santos, Carlos Laorden, Borja Sanz, International Journal of Security and Networks . 2013,第2期

机译：扭转针对基于内容的垃圾邮件过滤器的令牌化攻击的影响
2. SDAI: An integral evaluation methodology for content-based spam filtering models [J] . Noemi Perez-Diaz, David Ruano-Ordas, Florentino Fdez-Riverola, Expert Systems with Application . 2012,第16期

机译：SDAI：基于内容的垃圾邮件过滤模型的整体评估方法
3. An Overview of Content-Based Spam Filtering Techniques [J] . A. Khorsi Informatica: An International Journal of Computing and Informatics . 2007,第3期

机译：基于内容的垃圾邮件过滤技术概述
4. JURD: Joiner of Un-Readable Documents to reverse tokenization attacks to content-based spam filters [C] . Santos Igor, Laorden Carlos, Sanz Borja, IEEE Consumer Communications and Networking Conference . 2013

机译：JURD：不可读文档的合并程序，用于将基于令牌的攻击逆转为基于内容的垃圾邮件过滤器
5. A multiple instance learning strategy for combating adversarial good word attacks on statistical spam filters [D] . Jorgensen, Zachary D. 2008

机译：一种多实例学习策略，可对抗统计垃圾邮件过滤器上的对抗好词攻击
6. Effects of reverse deployment of cone-shaped vena cava filter on improvements in hemodynamic performance in vena cava [O] . Ying Chen, Zaipin Xu, Xiaoyan Deng, 2021

机译：锥形腔静脉滤波器对静脉脉络膜血流动力学性能改善的影响
7. Content-based Approach for Vietnamese Spam SMS Filtering [O] . Pham, Thai-Hoang, Le-Hong, Phuong 2017

机译：基于内容的越南垃圾邮件短信过滤方法
8. Machine Learning in the Presence of an Adversary: Attacking and Defending the SpamBayes Spam Filter [R] . Saini, U. 2008

机译：在对手面前进行机器学习：攻击和防御spamBayes垃圾邮件过滤器

Reversing the effects of tokenisation attacks against content-based spam filters

摘要

著录项

相似文献

相关主题

期刊订阅