首页> 外文会议>International Conference on Text, Speech and Dialogue >Short Messages Spam Filtering Using Sentiment Analysis
【24h】

Short Messages Spam Filtering Using Sentiment Analysis

机译:使用情感分析,短消息垃圾邮件过滤

获取原文

摘要

In the same way that short instant messages are more and more used, spam and non-legitimate campaigns through this type of communication systems are growing up. Those campaigns, besides being an illegal online activity, are a direct threat to the privacy of the users. Previous short messages spam filtering techniques focus on automatic text classification and do not take message polarity into account. Focusing on phone SMS messages, this work demonstrates that it is possible to improve spam filtering in short message services using sentiment analysis techniques. Using a publicly available labelled (spam/legitimate) SMS dataset, we calculate the polarity of each message and aggregate the polarity score to the original dataset, creating new datasets. We compare the results of the best classifiers and filters over the different datasets (with and without polarity) in order to demonstrate the influence of the polarity. Experiments show that polarity score improves the SMS spam classification, on the one hand, reaching to a 98.91% of accuracy. And on the other hand, obtaining a result of 0 false positives with 98.67% of accuracy.
机译:以与短暂即时消息越来越多的方式,通过这种类型的通信系统的垃圾邮件和非合法运动正在成长。除了非法在线活动之外,这些活动是对用户隐私的直接威胁。上一封短信垃圾邮件过滤技术侧重于自动文本分类,并不考虑消息极性。专注于电话SMS消息,这项工作表明,使用情绪分析技术可以在短消息服务中提高垃圾邮件过滤。使用公共可用标记的(垃圾邮件/合法)SMS数据集,我们计算每个消息的极性,并将极性分数汇总到原始数据集,创建新数据集。我们将最佳分类器的结果与不同的数据集(有和不具有极性)进行比较以展示极性的影响。实验表明,极性分数一方面提高了SMS垃圾邮件分类,达到了精度的98.91%。另一方面,获得0个误报的结果,精度的98.67%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号