【24h】

Infinite Scaled Dirichlet Mixture Models for Spam Filtering via Bayesian and Variational Bayes Learning

机译:通过贝叶斯和变分贝叶斯学习的垃圾邮件过滤的无限尺度Dirichlet混合模型

获取原文

摘要

Spam filtering has been the topic of extensive research in the past. Many machine learning approaches have been proposed. In this paper we propose an approach based on nonparametric Bayesian inference via an infinite scaled Dirichlet mixture model. The scaled Dirichlet can be viewed as a flexible generalization of the well-known Dirichlet distribution. Our filtering framework uses both Markov Chain Monte Carlo techniques and a variational Bayes approach for the learning the resulting model. Unlike the majority of previous approaches, that have considered only the textual content of emails, our approach takes into account the visual content (i.e. images) which is largely ignored despite the fact that it is widely used by spammers. Extensive simulations and experiments have been conducted to demonstrate the merits of our framework.
机译:过去,垃圾邮件过滤一直是广泛研究的主题。已经提出了许多机器学习方法。在本文中,我们提出了一种基于非参数贝叶斯推断的方法,该方法是通过无限缩放的Dirichlet混合模型进行的。缩放后的Dirichlet可以看作是众所周知的Dirichlet分布的灵活概括。我们的过滤框架同时使用了马尔可夫链蒙特卡罗技术和变分贝叶斯方法来学习生成的模型。与大多数以前只考虑电子邮件文本内容的方法不同,我们的方法考虑了可视内容(即图像),尽管垃圾邮件发送者广泛使用了可视内容(图像)。已经进行了广泛的模拟和实验,以证明我们框架的优点。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号