Filtering image-based spam using multifractal analysis and active learning feedback-driven semi-supervised support vector machine

机译：使用多分析分析和主动学习反馈驱动的半监控支持向量机过滤基于图像的垃圾邮件

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Traditional anti-spam technologies can't block image-based spam because spammers employ a variety of image creation and randomization algorithms to make the message fully legible by the human eye but undistinguishable by the most anti-spam engines. In this paper we propose a novel composite method to filter image-based spam accurately and effectively, which can be easily implemented as a plug-in in SpamAssassin. Our method takes advantage of the two natures of image-based spams: large quantity, similarity and character variability. For the first nature, we use rules of SpamAssassin to detect the emails characteristic. If a new email has been identified as spam by the rules, it will be blocked. Otherwise, image-based mail will be captured by the plug-in. For the second nature,the plug-in will use multifractal analysis in multi-orientation wavelet pyramid algorithm to get image-based email texture descriptor which has strong invariance to many factors, use a hybrid filter-wrapper feature subset selection algorithm based on particle swarm optimization to reduce some redundant or irrelevant features in the texture descriptor, and use a semi-supervised support vector machines classification algorithm to detect whether an email is ham or spam, then use active learning clustering to get the most representative emails for relabeling through user feedback. The relabeled emails by users feedback and the unlabeled suspect spams by SVM will be used to retrain the classification for improving accuracy of spam filter. The experimental results demonstrate that our method is of high efficiency, high accuracy and low false positive rate. The accuracy will be improved and the false positive rate will be reduced along with more and more retraining. So, the method is fit especially for an adversarial learning and processing like spam filtering.

机译：传统的反垃圾邮件技术无法阻止基于图像的垃圾邮件，因为垃圾邮件发送器采用各种图像创建和随机化算法，使得人眼完全清晰清晰地清晰，但由最具反垃圾邮件发动机无法区分。在本文中，我们提出了一种新颖的复合方法，可以精确且有效地过滤基于图像的垃圾邮件，这可以很容易地实现为蜘蛛类中的插件。我们的方法利用了基于图像的两种基于图像的垃圾邮件：大量，相似性和字符变异性。对于第一个性质，我们使用SpamAssass的规则来检测电子邮件特征。如果已将新电子邮件识别为规则的垃圾邮件，则会被阻止。否则，将通过插件捕获基于图像的邮件。对于第二种性质，插件将在多向小波金字塔算法中使用多重分析来获取基于图像的电子邮件纹理描述符，这具有强大的许多因素的不变性，请使用基于粒子群的混合滤波器包装特征子集选择算法优化，以减少纹理描述符中的一些冗余或无关的功能，并使用半监控的支持向量机分类算法来检测电子邮件是否是火腿或垃圾邮件，然后使用主动学习聚类来获取最多代表性的电子邮件，以通过用户反馈重新标记。用户反馈和未标记的SVM的重新标记的电子邮件将用于恢复提高垃圾邮件过滤器精度的分类。实验结果表明，我们的方法具有高效率，高精度和低误率。准确性将得到改善，并且越来越多的效果将减少假阳性率。因此，该方法尤其适用于对垃圾邮件过滤等对抗的学习和处理。

著录项

来源
《IEEE International Conference on Computer-Aided Industrial Design Conceptual Design》|2014年||共5页
会议地点
作者
Jian Zhong; YiLu Zhou; Wei Deng;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
Active Learning Cluseting; Feedback-Driven Semi-Supervised Support Vector Machine; Image-Based Spam; Multifractal Analysis;

机译：主动学习CLUSETING;反馈驱动的半监控支持向量机;基于图像的垃圾邮件;多法分析;

相似文献

外文文献
中文文献
专利

1. Semi-supervised learning combining transductive support vector machine with active learning [J] . Wang Xibin, Wen Junhao, Alam Shafiq, Neurocomputing . 2016,第JANa15PTa3期

机译：半监督学习，将支持向量机与主动学习相结合
2. Semi-supervised active learning for support vector machines: A novel approach that exploits structure information in data [J] . Calma Adrian, Reitmaier Tobias, Sick Bernhard Information Sciences: An International Journal . 2018,第期

机译：用于支持向量机的半监督主动学习：一种利用数据中结构信息的新方法
3. Mobile SMS Spam Filtering for Nepali Text Using Na?ve Bayesian and Support Vector Machine [J] . Tej Bahadur Shahi, Abhimanu Yadav International Journal of Intelligence Science . 2014,第1期

机译：使用朴素贝叶斯和支持向量机对尼泊尔文本进行移动SMS垃圾邮件过滤
4. Filtering image-based spam using multifractal analysis and active learning feedback-driven semi-supervised support vector machine [C] . Jian Zhong, YiLu Zhou, Wei Deng IEEE International Conference on Computer-Aided Industrial Design Conceptual Design . 2014

机译：使用多分析分析和主动学习反馈驱动的半监控支持向量机过滤基于图像的垃圾邮件
5. On email spam filtering using support vector machine. [D] . Amayri, Ola. 2009

机译：在使用支持向量机的电子邮件垃圾邮件过滤中。
6. Machine learning for email spam filtering: review approaches and open research problems [O] . Emmanuel Gbenga Dada, Joseph Stephen Bassi, Haruna Chiroma, 2019

机译：用于电子邮件垃圾邮件过滤的机器学习：评论方法和公开研究问题
7. Semi-supervised learning combining transductive support vector machine with active learning [O] . Boli Lu, Xibin Wang 2015

机译：半监督学习结合转换支持向量机与主动学习

Filtering image-based spam using multifractal analysis and active learning feedback-driven semi-supervised support vector machine

摘要

著录项

相似文献

相关主题

期刊订阅