N-Gram Assisted Youtube Spam Comment Detection

Shreyas Aiyar; Nisha P Shetty

首页> 外文期刊>Procedia Computer Science >N-Gram Assisted Youtube Spam Comment Detection

【24h】

N-Gram Assisted Youtube Spam Comment Detection

机译：N-Gram辅助的YouTube垃圾邮件评论检测

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper proposes a novel methodology for the detection of intrusive comments or spam on the video-sharing website - Youtube. We describe spam comments as those which have a promotional intent or those who deem to be contextually irrelevant for a given video. The prospects of monetisation through advertising on popular social media channels over the years has attracted an increasingly larger number of users. This has in turn led to to the growth of malicious users who have begun to develop automated bots, capable of large-scale orchestrated deployment of spam messages across multiple channels simultaneously. The presence of these comments significantly hurts the reputation of a channel and also the experience of normal users. Youtube themselves have tackled this issue with very limited methods which revolve around blocking comments that contain links. Such methods have proven to be extremely ineffective as Spammers have found ways to bypass such heuristics. Standard machine learning classification algorithms have proven to be somewhat effective but there is still room for better accuracy with new approaches. In this work, we attempt to detect such comments by applying conventional machine learning algorithms such as Random Forest, Support Vector Machine, Naive Bayes along with certain custom heuristics such as N-Grams which have proven to be very effective in detecting and subsequently combating spam comments.

机译：本文提出了一种新颖的方法来检测视频共享网站Youtube上的侵入性评论或垃圾邮件。我们将垃圾评论描述为具有宣传意图或认为与特定视频在上下文上无关的评论。多年来，通过在流行的社交媒体渠道上投放广告来货币化的前景吸引了越来越多的用户。反过来，这又导致恶意用户的增长，他们开始开发自动漫游器，该漫游器能够同时跨多个渠道大规模协调部署垃圾邮件。这些评论的存在会严重损害频道的声誉以及普通用户的体验。 Youtube本身已通过非常有限的方法来解决此问题，这些方法围绕阻止包含链接的注释。由于垃圾邮件发送者发现了绕过这种启发式方法的方法，因此已证明这种方法极为无效。标准的机器学习分类算法已被证明是有效的，但是使用新方法仍存在更好的准确性。在这项工作中，我们尝试通过应用常规的机器学习算法（例如随机森林，支持向量机，朴素贝叶斯）以及某些自定义启发式方法（例如N-Grams）来检测此类评论，这些方法已被证明在检测和随后打击垃圾邮件方面非常有效评论。

著录项

来源
《Procedia Computer Science》 |2018年第22期|共9页
作者
Shreyas Aiyar; Nisha P Shetty;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
SpamYoutubeN-GramNaive-BayesRandom ForestSupport Vector MachineWord GramCharacter Gram;

机译：SpamYoutubeN-Gram朴素贝叶斯随机森林支持向量机Word GramCharacter Gram;

相似文献

外文文献
中文文献
专利

1. N-Gram Assisted Youtube Spam Comment Detection [J] . Shreyas Aiyar, Nisha P Shetty Procedia Computer Science . 2018,第1期

机译：N-Gram辅助的YouTube垃圾邮件评论检测
2. YouTube Spam Comments Detection Using Artificial Neural Network [J] . Thulfiqar Abd, Hussein Altabrawee, Samir Qaisar Ajmi Journal of Engineering & Applied Sciences . 2018,第22期

机译：YouTube垃圾评论使用人工神经网络检测
3. YouTube fails to tackle spam comments [J] . Computer active . 2013,第411期

机译：YouTube无法解决垃圾评论
4. A Mood Analysis on Youtube Comments and a Method for Improved Social Spam Detection [C] . Enaitz Ezpeleta, Mikel Iturbe, Inaki Garitano, International conference on hybrid artificial intelligent systems . 2018

机译：YouTube评论的情绪分析和改进的社交垃圾邮件检测方法
5. Social Media Comments to YouTube Videos by Cnn and Fox News Viewers on 2017 Legislative Efforts to Repeal and Replace the Affordable Care Act [D] . Mays, Genesa L. 2020

机译：2017年CNN和Fox新闻观众对YouTube视频评论的社交媒体评论是废除和取代经济实惠的护理法案的立法努力
6. Spam spam spam spam spam … [O] . Neville Goodman 2004

机译：垃圾邮件垃圾邮件垃圾邮件垃圾邮件垃圾邮件…
7. Mining User Comment Activity for Detecting Forum Spammers in YouTube [O] . Sureka, Ashish 2011

机译：挖掘用户评论活动以检测YouTube中的论坛垃圾邮件发送者

N-Gram Assisted Youtube Spam Comment Detection

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅