首页> 外文期刊>International Journal of Computational Intelligence and Applications >A Semi-Supervised Learning Approach for Tackling Twitter Spam Drift
【24h】

A Semi-Supervised Learning Approach for Tackling Twitter Spam Drift

机译:一种解决Twitter垃圾邮件漂移的半监督学习方法

获取原文
获取原文并翻译 | 示例
           

摘要

Twitter has changed the way people get information by allowing them to express their opinion and comments on the daily tweets. Unfortunately, due to the high popularity of Twitter, it has become very attractive to spammers. Unlike other types of spam, Twitter spam has become a serious issue in the last few years. The large number of users and the high amount of information being shared on Twitter play an important role in accelerating the spread of spam. In order to protect the users, Twitter and the research community have been developing different spam detection systems by applying different machine-learning techniques. However, a recent study showed that the current machine learning-based detection systems are not able to detect spam accurately because spam tweet characteristics vary over time. This issue is called “Twitter Spam Drift”. In this paper, a semi-supervised learning approach (SSLA) has been proposed to tackle this. The new approach uses the unlabeled data to learn the structure of the domain. Different experiments were performed on English and Arabic datasets to test and evaluate the proposed approach and the results show that the proposed SSLA can reduce the effect of Twitter spam drift and outperform the existing techniques.
机译:Twitter通过允许他们在每日推文上表达他们的意见和评论来改变人们获取信息的方式。不幸的是,由于Twitter的高普及,它对垃圾邮件发送者变得非常有吸引力。与其他类型的垃圾邮件不同,Twitter垃圾邮件在过去几年中已成为一个严重的问题。在Twitter上共享的大量用户和高量的信息在加速垃圾邮件传播方面发挥着重要作用。为了保护用户,推特和研究界通过应用不同的机器学习技术已经开发了不同的垃圾邮件检测系统。然而,最近的一项研究表明,基于机器的基于机器的检测系统无法准确地检测垃圾邮件,因为垃圾邮件推文特性随着时间的变化而变化。此问题称为“Twitter Spam Drive”。在本文中,提出了一种半监督学习方法(SSLA)来解决这个问题。新方法使用未标记的数据来学习域的结构。对英语和阿拉伯语数据集进行不同的实验进行测试和评估所提出的方法,结果表明,所提出的SSLA可以降低Twitter垃圾邮件漂移的影响,优于现有技术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号