首页> 外文会议>Asian Language Processing, 2009. IALP '09 >The Improved Logistic Regression Models for Spam Filtering

【24h】

The Improved Logistic Regression Models for Spam Filtering

机译：用于垃圾邮件过滤的改进Logistic回归模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The logistic regression model has achieved success in spam filtering. But it is disadvantaged by the equal adjustment of the feature weights appeared in both spam messages and ham ones during training period. This paper presents an improved logistic regression model which reduces the impact of the features appearing in both spam messages and ham ones. Byte level n-grams are employed to extract the features from messages, and TONE (Train On or Near Error) is adopted, which are proved effective in state-of-the-art spam filtering system. The official runs of CEAS (Conference on Email and Anti-Spam) Spam-filter Challenge 2008 show that the proposed model is one of the best methods. Our system achieved competitive results in all tasks and is the winner of active learning on the live stream by 1- ROCA.

机译：逻辑回归模型在垃圾邮件过滤方面取得了成功。但这是不利的，因为在培训期间，垃圾邮件和火腿邮件中特征权重的均等调整是不利的。本文提出了一种改进的逻辑回归模型，该模型可减少垃圾邮件和垃圾邮件中出现的功能的影响。字节级n-gram用于从邮件中提取特征，并采用TONE（Train On或Near Error），在最新的垃圾邮件过滤系统中被证明是有效的。 CEAS（电子邮件和反垃圾邮件会议）垃圾邮件过滤器挑战赛2008的官方运行表明，该模型是最好的方法之一。我们的系统在所有任务上均取得了竞争性成绩，并且是1- ROCA实时直播学习的赢家。

著录项

来源
《Asian Language Processing, 2009. IALP '09 》|2009年|314-317|共4页
会议地点 Singapore(SG);Singapore(SG)
作者
Han Yong; Yang Muyun; Qi Haoliang; He Xiaoning; Li Sheng;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
byte level n-gram; improved logistic regression; online learning; spam filtering;

机译：字节级n-gram；改进的逻辑回归；在线学习；垃圾邮件过滤;

相似文献

外文文献
中文文献
专利

1. Spam filtering using a logistic regression model trained by an artificial bee colony algorithm [J] . Applied Soft Computing . 2020 ,第期

机译：使用由人工蜂殖民地算法训练的逻辑回归模型进行垃圾邮件过滤
2. An Email Modelling Approach for Neural Network Spam Filtering to Improve Score-based Anti-spam Systems [J] . Yahya Alamlahi, Abdulrahman Muthana International Journal of Computer Network and Information Security . 2018 ,第12期

机译：用于神经网络垃圾邮件过滤的电子邮件建模方法，以改进基于分数的反垃圾邮件系统
3. Logistic Model Tree Induction Machine Learning Technique for Email Spam Filtering [J] . Emmanuel Gbenga Dada, Joseph Stephen Bassi Pacific Journal of Science and Technology . 2018 ,第2期

机译：用于电子邮件垃圾邮件过滤的Logistic模型树归纳机器学习技术
4. The Improved Logistic Regression Models for Spam Filtering [C] . Yong Han, Muyun Yang, Hapliang Qi, International Conference on Asian Language Processing . 2009

机译：用于垃圾邮件过滤的改进的逻辑回归模型
5. A study of the behaviors of test statistics in the binary logistic regression model, the proportional odds ordinal logistic regression model, and the proportional hazards model [D] . McBride, Mark Leon 2000

机译：二元逻辑回归模型，比例赔率序数逻辑回归模型和比例风险模型中检验统计量行为的研究
6. Reducing false positive incidental findings with ensemble genotyping and logistic regression-based variant filtering methods [O] . Kyu-Baek Hwang, In-Hee Lee, Jin-Ho Park, -1

机译：通过整体基因分型和基于逻辑回归的变异过滤方法减少假阳性偶然发现
7. Partitioned logistic regression for spam filtering [O] . Ming-wei Chang, Wen-tau Yih, Christopher Meek 2008

机译：用于垃圾邮件过滤的分区逻辑回归

The Improved Logistic Regression Models for Spam Filtering

摘要

著录项

相似文献

相关主题

期刊订阅