【24h】

Towards filtering spam mails using dimensionality reduction methods

机译:使用降维方法过滤垃圾邮件

获取原文
获取原文并翻译 | 示例

摘要

Numerous methods based on the content based filtering is available for email spam identification. Dimensionality of the feature space is recognized as one of the leading factors that affect the efficiency in classifying mails. This study identifies feature selection techniques used in the general text classification for spam filtering. Also, the classification and prediction is performed using different entities of email such as header, body and subject. We present a comparative study of different feature selection methods. Through extensive experiments we demonstrated that Weighted Mutual Information feature selection with header and body of the emails is efficient in email classification.
机译:许多基于内容过滤的方法可用于电子邮件垃圾邮件识别。特征空间的维数被认为是影响邮件分类效率的主要因素之一。这项研究确定了用于垃圾邮件过滤的常规文本分类中使用的特征选择技术。同样,使用电子邮件的不同实体(例如标题,正文和主题)执行分类和预测。我们对不同的特征选择方法进行了比较研究。通过广泛的实验,我们证明了带有电子邮件标题和正文的加权互信息特征选择在电子邮件分类中是有效的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号