首页> 外文期刊>Decision support systems >Fame for sale: Efficient detection of fake Twitter followers
【24h】

Fame for sale: Efficient detection of fake Twitter followers

机译:待出售的名望:高效检测假Twitter关注者

获取原文
获取原文并翻译 | 示例
       

摘要

Fake followers are those Twitter accounts specifically created to inflate the number of followers of a target account. Fake followers are dangerous for the social platform and beyond, since they may alter concepts like popularity and influence in the Twittersphere hence impacting on economy, politics, and society. In this paper, we contribute along different dimensions. First, we review some of the most relevant existing features and rules (proposed by Academia and Media) for anomalous Twitter accounts detection. Second, we create a baseline dataset of verified human and fake follower accounts. Such baseline dataset is publicly available to the scientific community. Then, we exploit the baseline dataset to train a set of machine-learning classifiers built over the reviewed rules and features. Our results show that most of the rules proposed by Media provide unsatisfactory performance in revealing fake followers, while features proposed in the past by Academia for spam detection provide good results. Building on the most promising features, we revise the classifiers both in terms of reduction of overfitting and cost for gathering the data needed to compute the features. The final result is a novel Class A classifier, general enough to thwart overfitting, lightweight thanks to the usage of the less costly features, and still able to correctly classify more than 95% of the accounts of the original training set. We ultimately perform an information fusion-based sensitivity analysis, to assess the global sensitivity of each of the features employed by the classifier.
机译:伪造的追随者是那些专门用来增加目标账户的追随者数量的Twitter帐户。假冒的追随者对于社交平台及其他社交平台而言是危险的,因为它们可能会改变Twitter圈中的知名度和影响力等概念,从而影响经济,政治和社会。在本文中,我们在不同方面做出了贡献。首先,我们回顾一些最相关的现有功能和规则(由Academia和Media提出),用于异常Twitter帐户检测。其次,我们创建一个经过验证的人工和虚假追踪者帐户的基准数据集。这样的基线数据集可公开提供给科学界。然后,我们利用基线数据集来训练一组基于已检查规则和功能的机器学习分类器。我们的结果表明,Media提出的大多数规则在揭示伪造的追随者方面表现不尽人意,而Academia过去为垃圾邮件检测提出的功能却提供了良好的结果。在最有前途的功能的基础上,我们在减少过度拟合和收集计算功能所需数据的成本方面修订了分类器。最终的结果是一种新颖的A类分类器,其通用性足以抵制过度拟合,轻巧,这归功于使用了成本较低的功能,并且仍然能够正确分类原始训练集的95%以上的帐户。我们最终进行基于信息融合的敏感性分析,以评估分类器使用的每个功能的整体敏感性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号