...
首页> 外文期刊>Research journal of applied science, engineering and technology >A Novel Feature Selection Based on One-Way ANOVA F-Test for E-Mail Spam Classification
【24h】

A Novel Feature Selection Based on One-Way ANOVA F-Test for E-Mail Spam Classification

机译:基于单向ANOVA F检验的电子邮件垃圾邮件分类新特征选择

获取原文
           

摘要

Spam is commonly defined as unwanted e-mails and it became a global threat against e-mail users. Although, Support Vector Machine (SVM) has been commonly used in e-mail spam classification, yet the problem of high data dimensionality of the feature space due to the massive number of e-mail dataset and features still exist. To improve the limitation of SVM, reduce the computational complexity (efficiency) and enhancing the classification accuracy (effectiveness). In this study, feature selection based on one-way ANOVA F-test statistics scheme was applied to determine the most important features contributing to e-mail spam classification. This feature selection based on one-way ANOVA F-test is used to reduce the high data dimensionality of the feature space before the classification process. The experiment of the proposed scheme was carried out using spam base well-known benchmarking dataset to evaluate the feasibility of the proposed method. The comparison is achieved for different datasets, categorization algorithm and success measures. In addition, experimental results on spam base English datasets showed that the enhanced SVM (FSSVM) significantly outperforms SVM and many other recent spam classification methods for English dataset in terms of computational complexity and dimension reduction.
机译:垃圾邮件通常被定义为不需要的电子邮件,它已成为对电子邮件用户的全球威胁。尽管支持向量机(SVM)已普遍用于电子邮件垃圾邮件分类,但是由于电子邮件数据集和特征的数量众多,特征空间的数据维度高的问题仍然存在。为了改善支持向量机的局限性,降低计算复杂度(效率)并提高分类准确性(有效性)。在这项研究中,基于单向方差分析F检验统计方案的特征选择被应用于确定有助于垃圾邮件分类的最重要特征。基于单向ANOVA F检验的特征选择用于减少分类过程之前特征空间的高数据维数。使用垃圾邮件基础众所周知的基准数据集进行了该方案的实验,以评估该方法的可行性。比较了不同的数据集,分类算法和成功措施。此外,基于垃圾邮件的英语数据集的实验结果表明,在计算复杂度和降维方面,增强的SVM(FSSVM)明显优于SVM和许多其他最近针对英语数据集的垃圾邮件分类方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号