首页> 外文会议>Information Security for South Africa Conference >Filtering spam e-mail with Generalized Additive Neural Networks
【24h】

Filtering spam e-mail with Generalized Additive Neural Networks

机译:通过广义添加剂神经网络过滤垃圾邮件电子邮件

获取原文

摘要

Some of the major security risks associated with spam e-mail are the spreading of computer viruses and the facilitation of phishing exercises. Spam is therefore regarded as one of the prominent security threats in modern organizations. Security controls, such as spam filtering techniques, have become increasingly important to protect information and information assets. In this paper the performance of a Generalized Additive Neural Network on a publicly available e-mail corpus is investigated in the context of statistical spam filtering. The neural network is compared to a Naive Bayesian classifier and a Memory-based technique. Generalized Additive Neural Networks have a number of advantages compared to neural networks in general. An automated construction algorithm performs feature and model selection simultaneously and produces results which can be interpreted by a graphical method. This algorithm is powerful, effective and performs highly accurate compared to other non-linear model selection methods. The paper also considers the impact of different feature set sizes using cost-sensitive measures. These criteria are sensitive to the cost difference between two common types of errors made by filtering systems. Experiments show better performance compared to the Naive Bayes and Memory-based classifiers where legitimate e-mails are assigned the same cost as spams. This result suggests Generalized Additive Neural Networks may be utilized to flag spam e-mails in order to prioritize the reading of messages.
机译:与垃圾邮件电子邮件相关的一些主要安全风险是计算机病毒的传播以及网络钓鱼练习的便利。因此,垃圾邮件被视为现代组织中突出的安全威胁之一。保护信息和信息资产等安全控制(如垃圾邮件过滤技术)越来越重要。本文在统计垃圾邮件滤波的背景下调查了广义添加剂神经网络在公开可用的电子邮件语料库上的性能。将神经网络与Naive Bayesian分类器和基于存储器的技术进行比较。与神经网络相比,广义添加剂神经网络具有许多优点。自动施工算法同时执行特征和模型选择,并产生可以通过图形方法解释的结果。与其他非线性模型选择方法相比,该算法功能强大,有效,高精度地执行。本文还考虑了使用成本敏感措施的不同特征设定大小的影响。这些标准对通过过滤系统进行的两个常见类型的错误之间的成本差异敏感。与天真贝叶斯和基于内存的分类器相比,实验表现出更好的性能,其中合法的电子邮件被分配与垃圾邮件相同的成本。该结果表明广义添加剂神经网络可以用于标记垃圾邮件电子邮件,以便优先考虑读取消息。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号