...
首页> 外文期刊>Expert systems with applications >Enhanced Topic-based Vector Space Model for semantics-aware spam filtering
【24h】

Enhanced Topic-based Vector Space Model for semantics-aware spam filtering

机译:增强的基于主题的向量空间模型,用于语义感知的垃圾邮件过滤

获取原文
获取原文并翻译 | 示例
           

摘要

Spam has become a major issue in computer security because it is a channel for threats such as computer viruses, worms and phishing. More than 85% of received e-mails are spam. Historical approaches to combat these messages including simple techniques such as sender blacklisting or the use of e-mail signatures, are no longer completely reliable. Currently, many solutions feature machine-learning algorithms trained using statistical representations of the terms that usually appear in the e-mails. Still, these methods are merely syntactic and are unable to account for the underlying semantics of terms within the messages. In this paper, we explore the use of semantics in spam filtering by representing e-mails with a recently introduced Information Retrieval model: the enhanced Topic-based Vector Space Model (eTVSM). This model is capable of representing linguistic phenomena using a semantic ontology. Based upon this representation, we apply several well-known machine-learning models and show that the proposed method can detect the internal semantics of spam messages.
机译:垃圾邮件已成为计算机安全的主要问题,因为它是计算机病毒,蠕虫和网络钓鱼等威胁的渠道。超过85%的电子邮件是垃圾邮件。与这些消息作斗争的历史方法,包括诸如发件人黑名单或使用电子邮件签名之类的简单技术,已不再完全可靠。当前,许多解决方案都采用了机器学习算法,这些算法是使用电子邮件中通常出现的术语的统计表示来训练的。但是,这些方法仅是语法上的,无法解释消息中术语的基本语义。在本文中,我们通过使用最近引入的信息检索模型:增强的基于主题的向量空间模型(eTVSM)来表示电子邮件,来探索语义在垃圾邮件过滤中的使用。该模型能够使用语义本体表示语言现象。基于这种表示,我们应用了几种著名的机器学习模型,并表明该方法可以检测垃圾邮件的内部语义。

著录项

  • 来源
    《Expert systems with applications》 |2012年第1期|p.437-444|共8页
  • 作者单位

    Laboratory for Smartness, Semantics and Security (S~3Lab), University of Deusto, Avenida de las Universidades 24,48007 Bilbao, Spain;

    Laboratory for Smartness, Semantics and Security (S~3Lab), University of Deusto, Avenida de las Universidades 24,48007 Bilbao, Spain;

    Laboratory for Smartness, Semantics and Security (S~3Lab), University of Deusto, Avenida de las Universidades 24,48007 Bilbao, Spain;

    Laboratory for Smartness, Semantics and Security (S~3Lab), University of Deusto, Avenida de las Universidades 24,48007 Bilbao, Spain;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    spam detection; information retrieval; semantics; computer security; machine-learning;

    机译:垃圾邮件检测;信息检索;语义计算机安全;机器学习;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号