首页> 外文会议>2014 5th International Conference- Confluence The Next Generation Information Technology Summit >A hybrid approach for spam filtering using local concentration based K-Means clustering

【24h】

A hybrid approach for spam filtering using local concentration based K-Means clustering

机译：使用基于局部集中的K-Means聚类的垃圾邮件过滤的混合方法

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Electronic mail (email) has become an essential element for Internet users. Many studies indicate that day by day numbers of internet users are increasing. As population increasing on the Internet, volume of email traffic is also growing. This entire volume of email consist 80% of unwanted emails. These unwanted emails are known as spam email and referred as unsolicited bulk email (UBE). These emails are sent in bulk to large number of recipients. This increased volume of spam email results a most common problem i.e. maintaining email inbox. Spam Email is major issue for internet community because it causes wastage of resources and also pollutes our environment. To prevent these adverse effects of spam email, spam filtering is essential task. Various researchers have proposed many techniques and algorithms for spam filtering; which focuses on individual parameters of the malicious content. In current scenario spammers are also become intelligent they attack on weak point of filtering system. In this work we divided entire process of filtering in four stages. At first stage we applied string tokenizer for generating terms from incoming message. These tokens are passed to second stage where we applied Information Gain (IG) as term selection strategy. After this we passed selected terms to third stage of filtering. Third stage consist of Local Concentration based Artificial Immune System for feature selection. Newly constructed feature vectors are passed to K-Means clustering algorithm for classification at fourth stage. In support of our work we conducted several experiments and gave a comparative analysis with various existing methods on different parameters.

机译：电子邮件（电子邮件）已成为Internet用户的基本要素。许多研究表明，互联网用户的数量每天都在增加。随着Internet上人口的增长，电子邮件通信量也在增长。电子邮件的全部数量占不需要电子邮件的80％。这些不需要的电子邮件称为垃圾邮件，也称为不请自来的批量电子邮件（UBE）。这些电子邮件将批量发送给大量收件人。垃圾邮件数量的增加导致最常见的问题，即维护电子邮件收件箱。垃圾电子邮件是Internet社区的主要问题，因为它导致资源浪费并污染我们的环境。为了防止垃圾邮件的这些不利影响，垃圾邮件过滤是必不可少的任务。许多研究人员提出了许多垃圾邮件过滤技术和算法。它着重于恶意内容的各个参数。在当前情况下，垃圾邮件发送者也变得很聪明，他们攻击过滤系统的薄弱环节。在这项工作中，我们将整个过滤过程分为四个阶段。在第一阶段，我们应用了字符串标记器，用于根据传入消息生成术语。这些令牌被传递到第二阶段，在该阶段我们应用信息增益（IG）作为术语选择策略。此后，我们将选定的术语传递到过滤的第三阶段。第三阶段包括用于特征选择的基于局部集中的人工免疫系统。新构建的特征向量在第四阶段传递给K-Means聚类算法进行分类。为了支持我们的工作，我们进行了几次实验，并使用各种现有方法对不同参数进行了比较分析。

著录项

来源
《2014 5th International Conference- Confluence The Next Generation Information Technology Summit 》|2014年|194-199|共6页
会议地点 Noida(IN)
作者
Jain Kunal; Agrawal Sanjay;
展开▼
作者单位

Dept of CEA, National Institute of Technical Teachers' Training and Research, Bhopal, India;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Classification algorithms; Feature extraction; Filtering; Immune system; Internet; Unsolicited electronic mail; AIS; Information Gain(IG); K-means Clustering; Legitimate; Spam;

机译：分类算法;特征提取;过滤;免疫系统;互联网;不请自来的电子邮件; AIS;信息增益（IG）; K均值聚类;合法性;垃圾邮件;;

相似文献

外文文献
中文文献
专利

1. A Local-Concentration-Based Feature Extraction Approach for Spam Filtering [J] . Zhu Y., Tan Y. Information Forensics and Security, IEEE Transactions on . 2011 ,第2期

机译：基于局部浓度的垃圾邮件特征提取方法
2. An incremental cluster-based approach to spam filtering [J] . Wen-Feng Hsiao, Te-Min Chang Expert systems with applications . 2008 ,第3期

机译：基于增量群集的垃圾邮件过滤方法
3. An Email Modelling Approach for Neural Network Spam Filtering to Improve Score-based Anti-spam Systems [J] . Yahya Alamlahi, Abdulrahman Muthana International Journal of Computer Network and Information Security . 2018 ,第12期

机译：用于神经网络垃圾邮件过滤的电子邮件建模方法，以改进基于分数的反垃圾邮件系统
4. A hybrid approach for spam filtering using local concentration based K-Means clustering [C] . Jain Kunal, Agrawal Sanjay International Conference- Confluence The Next Generation Information Technology Summit . 2014

机译：基于局部浓度的K均值聚类的垃圾邮件过滤的混合方法
5. An Improved Clustering based Monte Carlo Localization Approach for Cooperative Multi-robot Localization. [D] . Luo, Guanghui. 2011

机译：一种改进的基于聚类的蒙特卡洛协同多机器人协作定位方法。
6. Detection and Localization of Early-Stage Multiple Brain Tumors Using a Hybrid Technique of Patch-Based Processing k-means Clustering and Object Counting [O] . Mohamed Nasor, Walid Obaid 2020

机译：使用基于补丁的处理k均值聚类和对象计数的混合技术对早期多发性脑肿瘤进行检测和定位
7. Clustering-based Spam Image Filtering Considering Fuzziness of the Spam Image [O] . Master Prince 2016

机译：考虑垃圾邮件图像模糊性的基于聚类的垃圾邮件滤波
8. Development of Algorithms for Travel Time-Based Traffic Signal Timing, Phase I: A Hybrid Extended Kalman Filtering Approach for Traffic Density Estimation along Signalized Arterials [R] . Liu, H. X., Di, X. 2010

机译：基于行程时间的交通信号配时算法的发展，第一阶段：基于信号干线的交通密度估计的混合扩展卡尔曼滤波方法

A hybrid approach for spam filtering using local concentration based K-Means clustering

摘要

著录项

相似文献

相关主题

期刊订阅