首页> 外文会议>2012 Information Security for South Africa. >Filtering spam e-mail with Generalized Additive Neural Networks
【24h】

Filtering spam e-mail with Generalized Additive Neural Networks

机译:使用广义加性神经网络过滤垃圾邮件

获取原文
获取原文并翻译 | 示例

摘要

Some of the major security risks associated with spam e-mail are the spreading of computer viruses and the facilitation of phishing exercises. Spam is therefore regarded as one of the prominent security threats in modern organizations. Security controls, such as spam filtering techniques, have become increasingly important to protect information and information assets. In this paper the performance of a Generalized Additive Neural Network on a publicly available e-mail corpus is investigated in the context of statistical spam filtering. The neural network is compared to a Naive Bayesian classifier and a Memory-based technique. Generalized Additive Neural Networks have a number of advantages compared to neural networks in general. An automated construction algorithm performs feature and model selection simultaneously and produces results which can be interpreted by a graphical method. This algorithm is powerful, effective and performs highly accurate compared to other non-linear model selection methods. The paper also considers the impact of different feature set sizes using cost-sensitive measures. These criteria are sensitive to the cost difference between two common types of errors made by filtering systems. Experiments show better performance compared to the Naive Bayes and Memory-based classifiers where legitimate e-mails are assigned the same cost as spams. This result suggests Generalized Additive Neural Networks may be utilized to flag spam e-mails in order to prioritize the reading of messages.
机译:与垃圾邮件相关的一些主要安全风险是计算机病毒的传播和网络钓鱼活动的便利。因此,垃圾邮件被视为现代组织中的主要安全威胁之一。安全控制(例如垃圾邮件过滤技术)对于保护信息和信息资产变得越来越重要。在本文中,在统计垃圾邮件过滤的情况下研究了通用加性神经网络在可公开获得的电子邮件语料库上的性能。将神经网络与朴素贝叶斯分类器和基于内存的技术进行比较。与一般的神经网络相比,广义加性神经网络具有许多优势。自动构造算法同时执行特征和模型选择,并产生可通过图形方法解释的结果。与其他非线性模型选择方法相比,该算法功能强大,有效且执行精度高。本文还使用成本敏感措施来考虑不同功能集大小的影响。这些标准对由过滤系统产生的两种常见错误类型之间的成本差异很敏感。与朴素贝叶斯和基于内存的分类器相比,实验显示出更好的性能,在这些分类器中,为合法电子邮件分配与垃圾邮件相同的费用。该结果表明,可以利用通用加性神经网络来标记垃圾邮件,以便优先阅读邮件。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号