【24h】

Improving spam detection in Online Social Networks

机译:在线社交网络中提高垃圾邮件检测

获取原文

摘要

Online Social Networks (OSNs) are deemed to be the most sought-after societal tool used by the masses world over to communicate and transmit information. Our dependence on these platforms for seeking opinions, news, updates, etc. is increasing. While it is true that OSNs have become a new medium for dissemination of information, at the same time, they are also fast becoming a playground for the spread of misinformation, propaganda, fake news, rumors, unsolicited messages, etc. Consequently, we can say that an OSN platform comprises of two kinds of users namely, Spammers and Non-Spammers. Spammers, out of malicious intent, post either unwanted (or irrelevant) information or spread misinformation on OSN platforms. As part of our work, we propose mechanisms to detect such users (Spammers) in Twitter social network (a popular OSN). Our work is based on a number of features at tweet-level and user-level like Followers/Followees, URLs, Spam Words, Replies and HashTags. In our work, we have applied three learning algorithms namely Naive Bayes, Clustering and Decision trees. Furthermore, to improve detection of Spammers, a novel integrated approach is proposed which “combines” the advantages of the three learning algorithms mentioned above. Improvement of spam detection is measured on the basis of Total Accuracy, Spammers Detection Accuracy and Non-Spammers Detection Accuracy. Results, thus obtained, show that our novel integrated approach that combines all algorithms outperforms other classical approaches in terms of overall accuracy and detect Non-Spammers with 99% accuracy with an overall accuracy of 87.9%.
机译:在线社交网络(OSNS)被视为群众世界过度达到沟通和传输信息的最追捧的社会工具。我们对寻求意见,新闻,更新等的这些平台的依赖性正在增加。虽然osns已成为传播信息的新媒介,同时,它们也很快成为误导,宣传,假新闻,谣言,未经请求的消息等的游乐场。因此,我们可以说OSN平台包括两种用户,即垃圾邮件发送者和非垃圾邮件发送者。垃圾邮件发送者,出于恶意的意图,发布不需要的(或无关)信息或在OSN平台上传播错误信息。作为我们工作的一部分,我们提出了检测Twitter社交网络(流行OSN)中这些用户(垃圾邮件发送者)的机制。我们的工作基于Tweet-Level和用户级别的许多功能,如追随者/追随者,URL,垃圾邮件,回复和HASHTAG。在我们的工作中,我们已经应用了三种学习算法,即天鹅,聚类和决策树。此外,为了改善垃圾邮件发送者的检测,提出了一种新的综合方法,其“结合”上述三种学习算法的优点。垃圾邮件检测的改进是基于总精度,垃圾邮件发送的垃圾邮件检测精度和非垃圾邮件发送者检测精度测量。从而获得的结果表明,我们的新综合方法将所有算法结合在整体准确性方面优于其他经典方法,并检测了99%精度的非垃圾邮件仪,总精度为87.9%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号