【24h】

An Assessment of Case-Based Reasoning for Spam Filtering

机译:基于案例的垃圾邮件过滤推理评估

获取原文
获取原文并翻译 | 示例
           

摘要

Because of the changing nature of spam, a spam filtering system that uses machine learning will need to be dynamic. This suggests that a case-based (memory-based) approach may work well. Case-Based Reasoning (CBR) is a lazy approach to machine learning where induction is delayed to run time. This means that the case base can be updated continuously and new training data is immediately available to the induction process. In this paper we present a detailed description of such a system called ECUE and evaluate design decisions concerning the case representation. We compare its performance with an alternative system that uses Naive Bayes. We find that there is little to choose between the two alternatives in cross-validation tests on data sets. However, ECUE does appear to have some advantages in tracking concept drift over time.
机译:由于垃圾邮件的性质不断变化,使用机器学习的垃圾邮件过滤系统将需要动态。这表明基于案例(基于内存)的方法可能效果很好。基于案例的推理(CBR)是一种懒惰的机器学习方法,其中归纳延迟了运行时间。这意味着案例库可以连续更新,并且新的培训数据可立即用于上岗过程。在本文中,我们对称为ECUE的系统进行了详细描述,并评估了有关案例表示的设计决策。我们将其性能与使用Naive Bayes的替代系统进行比较。我们发现在数据集的交叉验证测试中,在这两种选择之间几乎没有选择。但是,ECUE在跟踪随时间推移的概念漂移方面确实具有一些优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号