首页> 外文会议>International Conference on Digital Information Management >A data mining based spam detection system for YouTube
【24h】

A data mining based spam detection system for YouTube

机译:YouTube的基于数据挖掘的垃圾邮件检测系统

获取原文

摘要

People now feel more comfortable socializing over the internet through popular social networking and media websites than face to face. Thus, the social media websites are thriving more and more nowadays. Like others YouTube is a vastly popular social media site which is expanding at very fast pace. YouTube depends mostly on user created contents and sharing and spreading. Business entities and public figures are taking advantage of this popularity by creating their own page and shared information among the large number of visitors. However, due to this popularity, YouTube has become more susceptible to different types of unwanted and malicious spammer. Currently, YouTube does not have any way to handle its video spammers. It only considers mass comments or messages to be part of spamming. To increase the popularity of a video, malicious users post video response spam, where the video content is not related to the topic being discussed in the particular video or does not contain the media it is supposed to. In this research, we explore different attributes that could lead to video spammers. We first collect data of YouTube videos and manually classify them as either legitimate videos or spams. We then devise a number of attributes of videos which could potentially be used to detect spams. We apply Microsoft SQL Server Data Mining Tools (SSDT) to provide a heuristic for classifying an arbitrary video as either spam or legitimate. Our result demonstrates that in the long run we could successfully classify videos as spam or legitimate videos for most of the cases.
机译:现在,人们比通过面对面的社交网络更喜欢通过流行的社交网络和媒体网站进行社交。因此,如今社交媒体网站正在蓬勃发展。与其他YouTube一样,YouTube是一个非常受欢迎的社交媒体网站,并且正在以非常快的速度发展。 YouTube主要取决于用户创建的内容以及共享和传播。商业实体和公众人物通过创建自己的页面并在大量访问者之间共享信息来利用这种受欢迎程度。但是,由于这种受欢迎程度,YouTube变得更容易受到各种类型的垃圾邮件和恶意垃圾邮件发送者的攻击。目前,YouTube无法处理其视频垃圾邮件发送者。它仅将大量评论或邮件视为垃圾邮件的一部分。为了增加视频的受欢迎程度,恶意用户发布了视频响应垃圾邮件,其中视频内容与特定视频中正在讨论的主题无关,或者不包含其应有的媒体。在这项研究中,我们探索了可能导致视频垃圾邮件发送者的不同属性。我们首先收集YouTube视频的数据,然后将其手动分类为合法视频或垃圾邮件。然后,我们设计视频的许多属性,这些属性可能会被用来检测垃圾邮件。我们使用Microsoft SQL Server数据挖掘工具(SSDT)来提供启发式功能,将任意视频分类为垃圾邮件或合法视频。我们的结果表明,从长远来看,在大多数情况下,我们可以成功地将视频归类为垃圾邮件或合法视频。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号