...
首页> 外文期刊>Applied Soft Computing >Rough sets for spam filtering: Selecting appropriate decision rules for boundary e-mail classification
【24h】

Rough sets for spam filtering: Selecting appropriate decision rules for boundary e-mail classification

机译:垃圾邮件过滤的粗糙集:为边界电子邮件分类选择适当的决策规则

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Nowadays, spam represents an extensive subset of the information delivered through Internet involving all unsolicited and disturbing communications received while using different services including e-mail, weblogs and forums. In this context, this paper reviews and brings together previous approaches and novel alternatives for applying rough set (RS) theory to the spam filtering domain by defining three different rule execution schemes: MFD (most frequent decision), LNO (largest number of objects) and LTS (largest total strength). With the goal of correctly assessing the suitability of the proposed algorithms, we specifically address and analyse significant questions for appropriate model validation like corpus selection, preprocessing and representational issues, as well as different specific benchmarking measures. From the experiments carried out using several execution schemes for selecting appropriate decision rules generated by rough sets, we conclude that the proposed algorithms can outperform other well-known anti-spam filtering techniques such as support vector machines (SVM), Adaboost and different types of Bayes classifiers.
机译:如今,垃圾邮件代表了通过Internet传递的大量信息,涉及使用各种服务(包括电子邮件,网络日志和论坛)时收到的所有未经请求的和令人不安的通信。在这种情况下,本文通过定义三种不同的规则执行方案:MFD(最频繁决策),LNO(最大对象数),回顾了将粗糙集(RS)理论应用于垃圾邮件过滤域的先前方法和新颖替代方法,和LTS(最大总强度)。为了正确评估所提出算法的适用性,我们专门针对适当的模型验证(例如语料库选择,预处理和表示问题以及不同的特定基准测试方法)解决并分析了重要问题。从使用几种执行方案来选择由粗糙集生成的适当决策规则的实验中,我们得出的结论是,所提出的算法可以胜过其他知名的反垃圾邮件过滤技术,例如支持向量机(SVM),Adaboost和不同类型的贝叶斯分类器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号