An Effective Concept Drift Detection Technique with Kernel Extreme Learning Machine for Email Spam Filtering

机译：电子邮件垃圾邮件过滤内核极端学习机的有效概念漂移检测技术

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The increase in the number of undesirable emails named spam has posed a major requirement to develop a highly dependent and robust antispam filters. This paper presents a novel email spam filtering technique with the capability of adapting with the dynamic environment. Concept drift detector attempts to determine the position of the concept drift in large data stream for replacing the baseline learner next to the modifications in the data distribution and therefore enhances accuracy. The proposed method detects the concept drift depending upon the computation of variation in the email content distribution using Statistical Test of Equal Proportions (STEPD) technique. The STEPD is a simpler commonly available model that identifies the concept drift with respect to a hypothesis test among two proportions. The SPEPD technique is used to determine the criteria of the concept drift for all unknown emails that assist the filtering technique in the recognition of the occurrence of the spam. In addition, the kernel extreme learning machine (KELM) based classification model is applied to classify the instances into two class labels namely spam and non-spam correspondingly. The experimental results of the STEPD-KELM model are tested against Enron dataset and the results are examined interms of distinct aspects. The experimental values indicated that the STEPD-KELM model has resulted to a maximum precision of 93.78%, recall of 96.54%, and accuracy of 95.33%.

机译：名为SPAM的不良电子邮件数量的增加构成了开发高度依赖性和强大的抗驱动器过滤器的主要要求。本文提出了一种新型电子邮件垃圾邮件过滤技术，具有适应动态环境的能力。概念漂移探测器试图确定大数据流中概念漂移的位置，以将基线学习者替换为数据分布的修改旁边，因此提高了准确性。所提出的方法根据使用相同比例（STEPD）技术的统计测试的电子邮件内容分发的变化计算来检测概念漂移。 STEPD是一种更简单的常用模型，其识别关于两个比例之间的假设测试的概念漂移。 SPEPD技术用于确定所有未知电子邮件的概念漂移的标准，可以帮助过滤技术在识别垃圾邮件的情况下。此外，应用基于内核的基于学习机（KELM）的分类模型，将实例分为两类标签，即相应的垃圾邮件和非垃圾邮件。 STEPD-KELM模型的实验结果针对enron数据集进行了测试，结果被检查了不同方面的互联网。实验值表明，Stepd-Kelm模型导致最高精度为93.78％，召回的96.54％，准确度为95.33％。

著录项

来源
《International Conference on Intelligent Sustainable Systems》|2020年|774-779|共6页
会议地点
作者
S Priya; R Annie Uthra;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Filtering; Unsolicited e-mail; Training; Predictive models; Kernel; Data models; Adaptation models;

机译：过滤;未经请求的电子邮件;培训;预测模型;内核;数据模型;适应模型;

相似文献

外文文献
中文文献
专利

1. 一种新型异构集成极端学习机模型及其软测量应用 [J] . 马宁, 董泽东南大学学报（英文版） . 2020,第001期
2. Meta-cognitive recurrent kernel online sequential extreme learning machine with kernel adaptive filter for concept drift handling [J] . Zongying Liu, Chu Kiong Loo, Kitsuchart Pasupa, Engineering Applications of Artificial Intelligence . 2020,第Feba期

机译：元认知递归内核在线顺序极限学习机，带有内核自适应滤波器，用于概念漂移处理
3. Email Spam Filtering using Supervised Machine Learning Techniques [J] . V.Christina, S.Karpagavalli, G.Suganya International Journal on Computer Science and Engineering . 2010,第9期

机译：使用监督机器学习技术的电子邮件垃圾邮件过滤
4. Email Spam Filtering using Supervised Machine Learning Techniques [J] . V.Christina, S.Karpagavalli, G.Suganya International Journal on Computer Science and Engineering . 2010,第9期

机译：使用监督机器学习技术的电子邮件垃圾邮件过滤
5. Content-based concept drift detection for Email spam filtering [C] . Hayat M.Z., Basiri J., Seyedhossein L., 2010 5th International Symposium on Telecommunications . 2010

机译：用于电子邮件垃圾邮件过滤的基于内容的概念漂移检测
6. An evaluation of machine learning techniques for enterprise spam filters [D] . Tuttle, Andrew 2004

机译：对企业垃圾邮件过滤器的机器学习技术的评估
7. Machine learning for email spam filtering: review approaches and open research problems [O] . Emmanuel Gbenga Dada, Joseph Stephen Bassi, Haruna Chiroma, 2019

机译：用于电子邮件垃圾邮件过滤的机器学习：评论方法和公开研究问题
8. A Survey of Learning-Based Techniques of Email Spam Filtering [O] . Enrico Blanzieri, Anton Bryl 2007

机译：基于学习的电子邮件垃圾邮件过滤技术概述

An Effective Concept Drift Detection Technique with Kernel Extreme Learning Machine for Email Spam Filtering

摘要

著录项

相似文献

相关主题

期刊订阅