首页> 外文会议>ICETC 2010;International Conference on Education Technology and Computer >Using DBSCAN Clustering Algorithm in Spam Identifying
【24h】

Using DBSCAN Clustering Algorithm in Spam Identifying

机译:在垃圾邮件识别中使用DBSCAN聚类算法

获取原文

摘要

In the field of internet research, anti-spam mechanism has become a focus currently. The identification of spam plays an important role in current anti-spam mechanism. In order to identify spam efficiently, it usually needs to be able to identify similar emails, i.e. spam clustering. Using the present methods to cluster the emails, many similar emails will be clustered into several groups. For improving the accuracy of spam identification, we present a new clustering method which is based on the DBSCAN clustering algorithm and nilsimsa digest algorithm. Using this method, all emails identified similar artificially are clustered together. The result of the simulation shows that the clustering method based on DBSCAN and nilsimsa performs with higher clustering accuracy than the other clustering methods. From the simulation result, we can also conclude that the shape of the spam digest subspace is irregular.
机译:在互联网研究领域,反垃圾邮件机制已成为当前的关注焦点。垃圾邮件的识别在当前的反垃圾邮件机制中起着重要的作用。为了有效地识别垃圾邮件,通常需要能够识别类似的电子邮件,即垃圾邮件群集。使用本发明的方法将电子邮件聚类,许多相似的电子邮件将被聚类为几个组。为了提高垃圾邮件识别的准确性,我们提出了一种新的基于DBSCAN聚类算法和nilsimsa摘要算法的聚类方法。使用此方法,可以将所有人为识别的相似电子邮件合并在一起。仿真结果表明,基于DBSCAN和nilsimsa的聚类方法比其他聚类方法具有更高的聚类精度。从模拟结果,我们还可以得出结论,垃圾邮件摘要子空间的形状是不规则的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号