首页> 外文期刊>IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences >O-means: An Optimized Clustering Method for Analyzing Spam Based Attacks
【24h】

O-means: An Optimized Clustering Method for Analyzing Spam Based Attacks

机译:O均值:一种用于分析基于垃圾邮件的攻击的优化聚类方法

获取原文
获取原文并翻译 | 示例
           

摘要

In recent years, the number of spam emails has been dramatically increasing and spam is recognized as a serious internet threat. Most recent spam emails are being sent by bots which often operate with others in the form of a botnet, and skillful spammers try to conceal their activities from spam analyzers and spam detection technology. In addition, most spam messages contain URLs that lure spam receivers to malicious Web servers for the purpose of carrying out various cyber attacks such as malware infection, phishing attacks, etc. In order to cope with spam based attacks, there have been many efforts made towards the clustering of spam emails based on similarities between them. The spam clusters obtained from the clustering of spam emails can be used to identify the infrastructure of spam sending systems and malicious Web servers, and how they are grouped and correlate with each other, and to minimize the time needed for analyzing Web pages. Therefore, it is very important to improve the accuracy of the spam clustering as much as possible so as to analyze spam based attacks more accurately. In this paper, we present an optimized spam clustering method, called O-means, based on the K-means clustering method, which is one of the most widely used clustering methods. By examining three weeks of spam gathered in our SMTP server, we observed that the accuracy of the O-means clustering method is about 87% which is superior to the previous clustering methods. In addition, we define 12 statistical features to compare similarity between spam emails, and we determined a set of optimized features which makes the O-means clustering method more effective.
机译:近年来,垃圾邮件的数量急剧增加,垃圾邮件被认为是严重的互联网威胁。最新的垃圾邮件是由僵尸程序发送的,这些僵尸程序通常以僵尸网络的形式与他人合作,熟练的垃圾邮件发送者试图从垃圾邮件分析器和垃圾邮件检测技术中隐藏其活动。另外,大多数垃圾邮件都包含URL,这些URL会将垃圾邮件接收者引诱到恶意Web服务器,以进行各种网络攻击,例如恶意软件感染,网络钓鱼攻击等。为了应对基于垃圾邮件的攻击,已经做了很多努力。基于垃圾邮件之间的相似性来聚类。从垃圾邮件群集中获得的垃圾邮件群集可用于标识垃圾邮件发送系统和恶意Web服务器的基础结构,以及它们如何进行分组和相互关联,并最大程度地减少分析Web页面所需的时间。因此,尽可能提高垃圾邮件群集的准确性以更准确地分析基于垃圾邮件的攻击非常重要。在本文中,我们基于K-means聚类方法提出了一种优化的垃圾邮件聚类方法,称为O-means,它是使用最广泛的聚类方法之一。通过检查在我们的SMTP服务器中收集的三周垃圾邮件,我们观察到O-means聚类方法的准确性约为87%,这比以前的聚类方法要好。此外,我们定义了12个统计功能以比较垃圾邮件之间的相似性,并确定了一组优化的功能,这些功能使O-means聚类方法更加有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号