首页> 外文会议>International Conference on Cloud Computing >Exploiting the Spam Correlations in Scalable Online Social Spam Detection
【24h】

Exploiting the Spam Correlations in Scalable Online Social Spam Detection

机译:利用可扩展在线社交垃圾邮件检测中的垃圾邮件相关性

获取原文

摘要

The huge amount of social spam from large-scale social networks has been a common phenomenon in the contemporary world. The majority of former research focused on improving the efficiency of identifying social spam from a limited size of data in the algorithm side, however, few of them target on the data correlations among large-scale distributed social spam and utilize the benefits from the system side. In this paper, we propose a new scalable system, named SpamHunter, which can utilize the spam correlations from distributed data sources to enhance the performance of large-scale social spam detection. It identifies the correlated social spam from various distributed servers/sources through DHT-based hierarchical functional trees. These functional trees act as bridges among data servers/sources to aggregate, exchange, and communicate the updated and newly emerging social spam with each other. Furthermore, by processing the online social logs instantly, it allows online streaming data to be processed in a distributed manner, which reduces the online detection latency and avoids the inefficiency of outdated spam posts. Our experimental results with real-world social logs demonstrate that SpamHunter reaches 95% F1 score in the spam detection, achieves high efficiency in scaling to a large amount of data servers with low latency.
机译:来自大型社交网络的大量社交垃圾邮件在当代世界中一直是一种常见的现象。以前研究的大多数研究侧重于提高算法侧的有限数据识别社交垃圾邮件的效率,然而,其中很少有人在大规模分布式社会垃圾邮件之间的数据相关性上,并利用系统侧的益处。在本文中,我们提出了一个名为SpamHunter的新的可扩展系统,它可以利用来自分布式数据源的垃圾邮件相关性来增强大规模社会垃圾邮件检测的性能。它通过基于DHT的分层功能树来识别来自各种分布式服务器/源的相关社交垃圾邮件。这些功能树作为数据服务器/源之间的桥梁,以聚合,交换,并互相传达更新和新兴的社交垃圾邮件。此外,通过立即处理在线社交日志,它允许以分布式方式处理在线流数据,这减少了在线检测延迟并避免了过时的垃圾邮件帖子的低效率。我们的实验结果与现实世界的社交日志表明,垃圾邮件在垃圾邮件检测中达到95%F1得分,在缩放到具有低延迟的大量数据服务器方面取得了高效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号