Rough sets for spam filtering: Selecting appropriate decision rules for boundary e-mail classification

Noemi Perez-Diaz; David Ruano-Ordas; Jose R. Mendez; Juan F. Galvez; Florentino Fdez-Riverola

首页> 外文期刊>Applied Soft Computing >Rough sets for spam filtering: Selecting appropriate decision rules for boundary e-mail classification

【24h】

Rough sets for spam filtering: Selecting appropriate decision rules for boundary e-mail classification

机译：垃圾邮件过滤的粗糙集：为边界电子邮件分类选择适当的决策规则

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Nowadays, spam represents an extensive subset of the information delivered through Internet involving all unsolicited and disturbing communications received while using different services including e-mail, weblogs and forums. In this context, this paper reviews and brings together previous approaches and novel alternatives for applying rough set (RS) theory to the spam filtering domain by defining three different rule execution schemes: MFD (most frequent decision), LNO (largest number of objects) and LTS (largest total strength). With the goal of correctly assessing the suitability of the proposed algorithms, we specifically address and analyse significant questions for appropriate model validation like corpus selection, preprocessing and representational issues, as well as different specific benchmarking measures. From the experiments carried out using several execution schemes for selecting appropriate decision rules generated by rough sets, we conclude that the proposed algorithms can outperform other well-known anti-spam filtering techniques such as support vector machines (SVM), Adaboost and different types of Bayes classifiers.

机译：如今，垃圾邮件代表了通过Internet传递的大量信息，涉及使用各种服务（包括电子邮件，网络日志和论坛）时收到的所有未经请求的和令人不安的通信。在这种情况下，本文通过定义三种不同的规则执行方案：MFD（最频繁决策），LNO（最大对象数），回顾了将粗糙集（RS）理论应用于垃圾邮件过滤域的先前方法和新颖替代方法，和LTS（最大总强度）。为了正确评估所提出算法的适用性，我们专门针对适当的模型验证（例如语料库选择，预处理和表示问题以及不同的特定基准测试方法）解决并分析了重要问题。从使用几种执行方案来选择由粗糙集生成的适当决策规则的实验中，我们得出的结论是，所提出的算法可以胜过其他知名的反垃圾邮件过滤技术，例如支持向量机（SVM），Adaboost和不同类型的贝叶斯分类器。

著录项

来源
《Applied Soft Computing》 |2012年第11期|共12页
作者
Noemi Perez-Diaz; David Ruano-Ordas; Jose R. Mendez; Juan F. Galvez; Florentino Fdez-Riverola;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算机软件;
关键词
Spam classification; Rough sets; Rule execution schemes; Content-based techniques; Model evaluation;

机译：垃圾邮件分类;粗糙集;规则执行方案;基于内容的技术;模型评估;

相似文献

外文文献
中文文献
专利

1. Rough sets for spam filtering: Selecting appropriate decision rules for boundary e-mail classification [J] . Noemi Perez-Diaz, David Ruano-Ordas, Jose R. Mendez, Applied Soft Computing . 2012,第11期

机译：垃圾邮件过滤的粗糙集：为边界电子邮件分类选择适当的决策规则
2. Development of decision support system for product selection based on AHP, using the decision rule of rough set for qualitative evaluation [J] . Masaki Yumoto Electronics and communications in Japan . 2019,第12期

机译：利用粗糙集决策规则进行定性评价的基于层次分析法的产品选择决策支持系统的开发
3. An application of the logic of explanatory power in rough set analysis: implications for the classification of decision rules [J] . Anthony T. Odoemena International journal of data science . 2019,第2期

机译：解释性逻辑在粗糙集分析中的应用：决策规则分类的影响
4. A New Hybrid Rough Set and Soft Set Parameter Reduction Method for Spam E-Mail Classification Task [C] . Masurah Mohamad, Ali Selamat Pacific Rim knowledge acquisition workshop . 2016

机译：垃圾邮件分类任务的一种新的混合粗糙集和软集参数约简方法
5. Feature selection strategies for spam e-mail filtering. [D] . Wang, Ren. 2006

机译：垃圾邮件过滤的功能选择策略。
6. Antimicrobial peptide similarity and classification through rough set theory using physicochemical boundaries [O] . Kyle Boone, Kyle Camarda, Paulette Spencer, 2018

机译：抗菌肽相似性和分类通过理化边界的粗糙集理论
7. Rough Set Theory Approach for Filtering Spams from boundary messages in a Chat System [O] . Sanjiban Sekhar Roy, Saptarshi Charaborty, Swapnil Sourav, 2014

机译：基于聊天系统边界消息过滤垃圾邮件的粗糙集理论方法
8. Measuring uncertainty by extracting fuzzy rules using rough sets and extracting fuzzy rules under uncertainty and measuring definability using rough sets [R] . Worm, Jeffrey A., Culas, Donald E. 1991

机译：通过粗糙集提取模糊规则并在不确定条件下提取模糊规则并使用粗糙集测量可定义性来测量不确定性

Rough sets for spam filtering: Selecting appropriate decision rules for boundary e-mail classification

摘要

著录项

相似文献

相关主题

期刊订阅