首页> 外文会议>International Conference on Advanced Computer Science and Information Systems >Multiclass SMS message categorization: Beyond spam binary classification
【24h】

Multiclass SMS message categorization: Beyond spam binary classification

机译:多类SMS邮件分类:超越垃圾邮件二进制分类

获取原文

摘要

SMS spam has been growing since mobile phone usage increases. Past researches on SMS spam detection only classified SMS into two categories, spam and not spam. The binary classification of SMS spam prevents the user from seeing the spam messages that they do not really hate, e.g. an advertisement from their favorite product In this paper, we propose multi-class classification of SMS into: regular, info, ads, and fraud. We use content-based (top-N unigram) as well as non-content based features. The result shows that the best accuracy is achieved by logistic regression that is 97.5 % accuracy with configuration of normalization preprocess and 4096 top-N unigram features.
机译:自从手机使用量增加以来,SMS垃圾邮件一直在增长。过去有关SMS垃圾邮件检测的研究仅将SMS分为垃圾邮件和非垃圾邮件两类。 SMS垃圾邮件的二进制分类可防止用户看到他们并不真正讨厌的垃圾邮件,例如,来自他们最喜欢的产品的广告在本文中,我们提出了SMS的多类分类:常规,信息,广告和欺诈。我们使用基于内容的(前N个字母组合)以及不基于内容的功能。结果表明,通过归一化预处理和4096个top-N字母组合特征的配置的logistic回归可实现97.5 \%的准确性,从而获得了最佳的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号