首页> 外文会议>International Conference on Intelligent Sustainable Systems >An Effective Concept Drift Detection Technique with Kernel Extreme Learning Machine for Email Spam Filtering
【24h】

An Effective Concept Drift Detection Technique with Kernel Extreme Learning Machine for Email Spam Filtering

机译:电子邮件垃圾邮件过滤内核极端学习机的有效概念漂移检测技术

获取原文

摘要

The increase in the number of undesirable emails named spam has posed a major requirement to develop a highly dependent and robust antispam filters. This paper presents a novel email spam filtering technique with the capability of adapting with the dynamic environment. Concept drift detector attempts to determine the position of the concept drift in large data stream for replacing the baseline learner next to the modifications in the data distribution and therefore enhances accuracy. The proposed method detects the concept drift depending upon the computation of variation in the email content distribution using Statistical Test of Equal Proportions (STEPD) technique. The STEPD is a simpler commonly available model that identifies the concept drift with respect to a hypothesis test among two proportions. The SPEPD technique is used to determine the criteria of the concept drift for all unknown emails that assist the filtering technique in the recognition of the occurrence of the spam. In addition, the kernel extreme learning machine (KELM) based classification model is applied to classify the instances into two class labels namely spam and non-spam correspondingly. The experimental results of the STEPD-KELM model are tested against Enron dataset and the results are examined interms of distinct aspects. The experimental values indicated that the STEPD-KELM model has resulted to a maximum precision of 93.78%, recall of 96.54%, and accuracy of 95.33%.
机译:名为SPAM的不良电子邮件数量的增加构成了开发高度依赖性和强大的抗驱动器过滤器的主要要求。本文提出了一种新型电子邮件垃圾邮件过滤技术,具有适应动态环境的能力。概念漂移探测器试图确定大数据流中概念漂移的位置,以将基线学习者替换为数据分布的修改旁边,因此提高了准确性。所提出的方法根据使用相同比例(STEPD)技术的统计测试的电子邮件内容分发的变化计算来检测概念漂移。 STEPD是一种更简单的常用模型,其识别关于两个比例之间的假设测试的概念漂移。 SPEPD技术用于确定所有未知电子邮件的概念漂移的标准,可以帮助过滤技术在识别垃圾邮件的情况下。此外,应用基于内核的基于学习机(KELM)的分类模型,将实例分为两类标签,即相应的垃圾邮件和非垃圾邮件。 STEPD-KELM模型的实验结果针对enron数据集进行了测试,结果被检查了不同方面的互联网。实验值表明,Stepd-Kelm模型导致最高精度为93.78%,召回的96.54%,准确度为95.33%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号