首页> 外文会议>International conference on cloud computing;World Congress on Services >Exploiting the Spam Correlations in Scalable Online Social Spam Detection
【24h】

Exploiting the Spam Correlations in Scalable Online Social Spam Detection

机译:在可扩展的在线社交垃圾邮件检测中利用垃圾邮件相关性

获取原文

摘要

The huge amount of social spam from large-scale social networks has been a common phenomenon in the contemporary world. The majority of former research focused on improving the efficiency of identifying social spam from a limited size of data in the algorithm side, however, few of them target on the data correlations among large-scale distributed social spam and utilize the benefits from the system side. In this paper, we propose a new scalable system, named SpamHunter, which can utilize the spam correlations from distributed data sources to enhance the performance of large-scale social spam detection. It identifies the correlated social spam from various distributed servers/sources through DHT-based hierarchical functional trees. These functional trees act as bridges among data servers/sources to aggregate, exchange, and communicate the updated and newly emerging social spam with each other. Furthermore, by processing the online social logs instantly, it allows online streaming data to be processed in a distributed manner, which reduces the online detection latency and avoids the inefficiency of outdated spam posts. Our experimental results with real-world social logs demonstrate that SpamHunter reaches 95% F1 score in the spam detection, achieves high efficiency in scaling to a large amount of data servers with low latency.
机译:来自大规模社交网络的大量社交垃圾邮件已成为当今世界的普遍现象。以往的研究大多集中在提高从算法侧有限的数据识别社交垃圾邮件的效率方面,然而,很少有研究针对大规模分布式社交垃圾邮件之间的数据相关性并利用系统侧的好处。在本文中,我们提出了一个名为SpamHunter的新可扩展系统,该系统可以利用来自分布式数据源的垃圾邮件相关性来增强大规模社交垃圾邮件检测的性能。它通过基于DHT的分层功能树从各种分布式服务器/源中识别相关的社交垃圾邮件。这些功能树充当数据服务器/源之间的桥梁,以相互聚合,交换和通信更新的和新兴的社交垃圾邮件。此外,通过立即处理在线社交日志,它允许以分布式方式处理在线流数据,从而减少了在线检测延迟并避免了垃圾邮件过时的效率低下。我们对真实社会日志的实验结果表明,SpamHunter在垃圾邮件检测中达到了95%的F1分数,可以高效地扩展到具有低延迟的大量数据服务器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号