首页> 外文会议>International conference on text, speech and dialogue >Short Messages Spam Filtering Using Sentiment Analysis
【24h】

Short Messages Spam Filtering Using Sentiment Analysis

机译:使用情感分析的短信垃圾邮件过滤

获取原文

摘要

In the same way that short instant messages are more and more used, spam and non-legitimate campaigns through this type of communication systems are growing up. Those campaigns, besides being an illegal online activity, are a direct threat to the privacy of the users. Previous short messages spam filtering techniques focus on automatic text classification and do not take message polarity into account. Focusing on phone SMS messages, this work demonstrates that it is possible to improve spam filtering in short message services using sentiment analysis techniques. Using a publicly available labelled (spam/legitimate) SMS dataset, we calculate the polarity of each message and aggregate the polarity score to the original dataset, creating new datasets. We compare the results of the best classifiers and filters over the different datasets (with and without polarity) in order to demonstrate the influence of the polarity. Experiments show that polarity score improves the SMS spam classification, on the one hand, reaching to a 98.91 % of accuracy. And on the other hand, obtaining a result of 0 false positives with 98.67% of accuracy.
机译:就像越来越多地使用短即时消息一样,通过这种类型的通信系统进行的垃圾邮件和不正当的活动也在不断增长。这些活动除了是非法的在线活动之外,还直接威胁到用户的隐私。以前的短消息垃圾邮件过滤技术着重于自动文本分类,并且没有考虑消息极性。着眼于电话SMS消息,这项工作表明可以使用情感分析技术来改善短消息服务中的垃圾邮件筛选。使用公开标记的(垃圾邮件/合法)SMS数据集,我们计算每封邮件的极性并将极性分数汇总到原始数据集,从而创建新的数据集。我们比较了最佳分类器和过滤器在不同数据集(有和没有极性)上的结果,以证明极性的影响。实验表明,极性评分可以改善SMS垃圾邮件的分类,一方面,可以达到98.91%的准确性。另一方面,获得0个误报的结果,准确度为98.67%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号