首页> 外文会议>IEEE International Conference on Computational Science and Engineering >An Enhanced Genetic Programming Approach for Detecting Unsolicited Emails
【24h】

An Enhanced Genetic Programming Approach for Detecting Unsolicited Emails

机译:一种增强的遗传编程方法,用于检测未经请求的电子邮件

获取原文

摘要

Identification of unsolicited emails (spams) is now a well-recognized research area within text classification. A good email classifier is not only evaluated by performance accuracy but also by the false positive rate. This research presents an Enhanced Genetic Programming (EGP) approach which works by building an ensemble of classifiers for detecting spams. The proposed classifier is tested on the most informative features of two publically available corpuses (Enron and spamassassin) found using Greedy stepwise search method. Thereafter, the proposed ensemble of classifiers is compared with various Machine Learning Classifiers: Genetic Programming (GP), Bayesian, Naive Bayes (NB), J48, Random forest (RF), and SVM. Results of this study indicate that the proposed classifier (EGP) is the best classifier among those compared in terms of performance accuracy as well as false positive rate.
机译:识别未经请求的电子邮件(垃圾邮件)现在是文本分类中的公认研究区域。良好的电子邮件分类器不仅通过性能准确性评估,而且还通过假阳性率来评估。该研究提出了一种增强的遗传编程(EGP)方法,其通过构建用于检测垃圾邮件的分类器的集合来作用。建议的分类器在使用贪婪逐步搜索方法中找到的两个公开可用的语料库(SENON和SPAMASSASIN)的最具信息特征测试。此后,将拟议的分类器集合与各种机器学习分类器进行比较:遗传编程(GP),贝叶斯,幼稚贝叶斯(NB),J48,随机森林(RF)和SVM。该研究的结果表明,所提出的分类器(EGP)是在性能准确度和假阳性率方面比较的最佳分类器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号