...
首页> 外文期刊>Neurocomputing >Spam filtering using semantic similarity approach and adaptive BPNN
【24h】

Spam filtering using semantic similarity approach and adaptive BPNN

机译:使用语义相似度方法和自适应BPNN的垃圾邮件过滤

获取原文
获取原文并翻译 | 示例
           

摘要

This paper proposes a novel approach for spam filtering based on various semantic similarity measures and an adaptive back propagation neural network (ABPNN). Semantic similarity approach is a promising avenue that addresses the problems for keyword based spam filtering model. In this paper, we propose a new method that integrates three kinds of semantic similarity approaches for spam filtering as a case study of data mining application. First, to construct a latent semantic feature space from training data with a statistical method. Second, to build a corpus based thesaurus by extracting the relationship between words based on its co-occurrence in the documents. Third, to combine the latent semantic feature space with the corpus based thesaurus. Back propagation neural network is one of the efficient approaches for classification. However, the traditional BPNN has the problems of slow learning and easy to trap into a local minimum. In this paper, we adopt an adaptive algorithm to improve the traditional BPNN that can overcome these problems. To investigate the effectiveness of our methods, we conduct extensive experiments on ling-spam, PU1 and PU3 data sets. Experimental results show that the proposed system is able to achieve higher performance, especially for the combination of the hybrid semantic similarity approach and the adaptive back propagation neural network.
机译:本文提出了一种基于各种语义相似性度量和自适应反向传播神经网络(ABPNN)的垃圾邮件过滤新方法。语义相似性方法是一个有前途的途径,可以解决基于关键字的垃圾邮件过滤模型的问题。在本文中,我们提出了一种新方法,该方法结合了三种语义相似性方法来进行垃圾邮件过滤,以数据挖掘应用为例。首先,使用统计方法从训练数据中构造潜在的语义特征空间。其次,通过在文档中基于单词的共现来提取单词之间的关系,从而构建基于词库的词库。第三,将潜在语义特征空间与基于语料库的词库结合起来。反向传播神经网络是有效的分类方法之一。但是,传统的BPNN具有学习速度慢和容易陷入局部最小值的问题。在本文中,我们采用一种自适应算法来改进传统的BPNN,以克服这些问题。为了研究我们方法的有效性,我们对垃圾邮件,PU1和PU3数据集进行了广泛的实验。实验结果表明,提出的系统能够达到较高的性能,特别是混合语义相似度方法和自适应反向传播神经网络的结合。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号