Information Theory Based Feature Valuing for Logistic Regression for Spam Filtering

机译：基于信息理论对垃圾邮件过滤的逻辑回归的特点

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Discriminative learning models such as Logistic Regression (LR) has shown good performance in spam filtering tasks. While most previous researches on LR have used binary features, this discards much useful information. To overcome this problem, information theory based feature valuing method for LR instead of traditional binary features is presented. The effectiveness of our approach has been evaluated on TREC, CEAS, and SEWM test sets. Results show that the proposed method outperforms the traditional binary features in the most test sets.

机译：逻辑回归（LR）等鉴别型学习模型在垃圾邮件过滤任务中表现出良好的性能。虽然对LR的最先前的大多数研究已经使用二进制特征，但这丢弃了很多有用的信息。为了克服这个问题，提出了基于信息理论的信息理论，而不是传统二元特征的基于特征估值方法。我们的方法的有效性已在TREC，CEA和SEWM测试集上进行评估。结果表明，该方法在最多测试集中优于传统二元特征。

著录项

来源
《International Conference on Asian Language Processing》|2010年||共4页
会议地点
作者
Qi Haoliang; He Xiaoning; Han Yong; Yang Muyun; Li Sheng;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP312-53;
关键词
feature valuing; informatin theory; logistic regression; spam fitering;

机译：特征估值;信息理论;逻辑回归;垃圾邮件过滤;

相似文献

外文文献
中文文献
专利

1. Spam filtering using a logistic regression model trained by an artificial bee colony algorithm [J] . Applied Soft Computing . 2020,第期

机译：使用由人工蜂殖民地算法训练的逻辑回归模型进行垃圾邮件过滤
2. Textual case-based reasoning for spam filtering: a comparison of feature-based and feature-free approaches [J] . Sarah Jane Delany, Derek Bridge Artificial Intelligence Review: An International Science and Engineering Journal . 2006,第1a2期

机译：基于文本案例的垃圾邮件过滤推理：基于特征的方法和无特征方法的比较
3. A new semantic-based feature selection method for spam filtering [J] . Mendez Jose R., Cotos-Yanez Tomas R., Ruano-Ordas David Applied Soft Computing . 2019,第期

机译：一种用于垃圾邮件过滤的新的语义特征选择方法
4. Information Theory Based Feature Valuing for Logistic Regression for Spam Filtering [C] . Qi Haoliang, He Xiaoning, Han Yong, 2010 International Conference on Asian Language Processing . 2010

机译：基于信息论的Logistic回归垃圾邮件过滤特征评估
5. Feature selection strategies for spam e-mail filtering. [D] . Wang, Ren. 2006

机译：垃圾邮件过滤的功能选择策略。
6. Reducing false positive incidental findings with ensemble genotyping and logistic regression-based variant filtering methods [O] . Kyu-Baek Hwang, In-Hee Lee, Jin-Ho Park, -1

机译：通过整体基因分型和基于逻辑回归的变异过滤方法减少假阳性偶然发现
7. Partitioned logistic regression for spam filtering [O] . Ming-wei Chang, Wen-tau Yih, Christopher Meek 2008

机译：用于垃圾邮件过滤的分区逻辑回归

Information Theory Based Feature Valuing for Logistic Regression for Spam Filtering

摘要

著录项

相似文献

相关主题

期刊订阅