A comparative study for content-based dynamic spam classification using four machine learning algorithms

Bo Yu; Zong-ben Xu

首页> 外文期刊>Knowledge-Based Systems >A comparative study for content-based dynamic spam classification using four machine learning algorithms

【24h】

A comparative study for content-based dynamic spam classification using four machine learning algorithms

机译：使用四种机器学习算法的基于内容的动态垃圾邮件分类的比较研究

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The growth of email users has resulted in the dramatic increasing of the spam emails during the past few years. In this paper, four machine learning algorithms, which are Naive Bayesian (NB), neural network (NN), support vector machine (SVM) and relevance vector machine (RVM), are proposed for spam classification. An empirical evaluation for them on the benchmark spam filtering corpora is presented. The experiments are performed based on different training set size and extracted feature size. Experimental results show that NN classifier is unsuitable for using alone as a spam rejection tool. Generally, the performances of SVM and RVM classifiers are obviously superior to NB classifier. Compared with SVM, RVM is shown to provide the similar classification result with less relevance vectors and much faster testing time. Despite the slower learning procedure, RVM is more suitable than SVM for spam classification in terms of the applications that require low complexity.

机译：电子邮件用户的增长导致过去几年垃圾邮件的激增。本文针对垃圾邮件分类，提出了朴素贝叶斯算法，神经网络NN，支持向量机SVM和相关向量机RVM 4种机器学习算法。提出了针对他们的基准垃圾邮件过滤语料库的经验评估。根据不同的训练集大小和提取的特征大小执行实验。实验结果表明，NN分类器不适合单独用作垃圾邮件拒绝工具。通常，SVM和RVM分类器的性能明显优于NB分类器。与SVM相比，RVM被证明可以提供相似的分类结果，相关向量更少，测试时间更快。尽管学习过程较慢，但就要求低复杂度的应用而言，RVM比SVM更适合于垃圾邮件分类。

著录项

来源
《Knowledge-Based Systems》 |2008年第4期|p.355-362|共8页
作者
Bo Yu; Zong-ben Xu;
展开▼
作者单位

School of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an 710049, China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化基础理论;
关键词
spam classification; naive bayesian; neural network; support vector machine; relevance vector machine;

机译：垃圾邮件分类;朴素贝叶斯;神经网络;支持向量机;相关向量机;

相似文献

外文文献
中文文献
专利

1. A comparative study of machine learning algorithms for physiological signal classification [J] . Giorgio Biagetti, Paolo Crippa, Laura Falaschetti, Procedia Computer Science . 2018,第22期

机译：机器学习算法用于生理信号分类的比较研究
2. Comparative analysis of image classification algorithms based on traditional machine learning and deep learning [J] . Wang Pin, Fan En, Wang Peng Pattern recognition letters . 2021,第Jana期

机译：基于传统机器学习和深度学习的图像分类算法的比较分析
3. A Comparative Study of Classification Algorithms for Spam Email Data Analysis [J] . Aman Sharma, Suruchi Sahni International Journal on Computer Science and Engineering . 2011,第5期

机译：垃圾邮件数据分析分类算法的比较研究
4. Ham or spam? A comparative study for some content-based classification algorithms for email filtering [C] . Saab Salwa Adriana, Mitri Nicholas, Awad Mariette 2014 17th IEEE Mediterranean Electrotechnical Conference . 2014

机译：火腿还是垃圾邮件？某些基于内容的电子邮件过滤分类算法的比较研究
5. A Study of Machine Learning Algorithms on Email Spam Classification [D] . ?Sattu, Neha 2020

机译：电子邮件垃圾邮件分类机器学习算法研究
6. Evaluation of the performance of traditional machine learning algorithms convolutional neural network and AutoML Vision in ultrasound breast lesions classification: a comparative study [O] . Ka Wing Wan, Chun Hoi Wong, Ho Fung Ip, 2021

机译：超声乳房病变中传统机器学习算法卷积神经网络和自动视力的性能评价：比较研究
7. A Study of Machine Learning Algorithms on Email Spam Classification [O] . N Sutta, Z Liu, X Zhang -1

机译：电子邮件垃圾邮件分类机器学习算法研究

A comparative study for content-based dynamic spam classification using four machine learning algorithms

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅