【24h】

Spam Detection Using Statistical Theorem

机译:使用统计定理垃圾邮件检测

获取原文

摘要

Bayesian filtering is an anti-spam algorithm which is designed to tackle spam dealing with probabilities. It is named after Bayes' Theorem of statistics, which is used to calculate the probability whether a message is a spam or not. This filter works efficiently by comparing e-mail content (phrases or tokens) against stored databases. This paper proposes Binomial Distribution and Poisson Distribution to be implemented in Bayesian spam filter. This approach is beneficial for calculating the probability of a mail being spam containing words that are not stored in database (i.e., encountered by the spam filter for the first time) or rare words (less frequent words). The proposed distribution for spam filters reduces and controls the false positives.
机译:贝叶斯滤波是一种反垃圾邮件算法,旨在解决处理概率的垃圾邮件。它以贝叶斯的统计定理命名,用于计算概率,无论是邮件是否为垃圾邮件。该过滤器通过将电子邮件内容(短语或令牌)与存储的数据库进行比较,有效地工作。本文提出了在贝叶斯垃圾邮件过滤器中实施的二项式分布和泊松分布。这种方法有利于计算包含在数据库中未存储在数据库中的单词的垃圾邮件的概率(即,第一次垃圾邮件过滤器遇到的邮件)或稀有单词(较少频繁的单词)。垃圾邮件过滤器的建议分发减少并控制了误报。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号