首页> 外文期刊>Journal of supercomputing >An enhanced graph-based semi-supervised learning algorithm to detect fake users on Twitter
【24h】

An enhanced graph-based semi-supervised learning algorithm to detect fake users on Twitter

机译:一种基于图的增强型半监督学习算法,可检测Twitter上的虚假用户

获取原文
获取原文并翻译 | 示例
           

摘要

Over the the past decade, social networking services (SNS) have proliferated on the web. The nature of such sites makes identity deception easy, providing a fast means for creating and managing identities, and then connecting with and deceiving others. Fake users are those accounts specifically created for purposes such as stalking or abuse of another user, for slander, or for marketing. The current system for detecting deception depends on behavioral, non-behavioral and user-generated content (UGC) information gathered from users. Although these methods have high detection accuracy, they cannot be implemented in databases with massive volumes of data. To address this issue, this paper proposes an enhanced graph-based semi-supervised learning algorithm (EGSLA) to detect fake users from a large volume of Twitter data. The proposed method encompasses four modules: data collection, feature extraction, classification and decision making. Data collected from Twitter using Scrapy is utilized for the evaluation. The performance of the proposed algorithm is tested with existing game theory, k-nearest neighbor (KNN), support vector machine (SVM) and decision tree techniques. The results show that the proposed EGSLA algorithm achieves 90.3% accuracy in spotting fake users.
机译:在过去的十年中,社交网络服务(SNS)在网络上激增。这些站点的性质使身份欺骗变得容易,提供了一种快速的方法来创建和管理身份,然后与他人联系并欺骗他人。假用户是专门为跟踪或滥用其他用户,诽谤或营销目的创建的帐户。用于检测欺骗的当前系统取决于从用户收集的行为,非行为和用户生成的内容(UGC)信息。尽管这些方法具有很高的检测精度,但无法在具有大量数据的数据库中实现。为了解决这个问题,本文提出了一种基于图的增强型半监督学习算法(EGSLA),可以从大量Twitter数据中检测假用户。该方法包括四个模块:数据收集,特征提取,分类和决策。使用Scrapy从Twitter收集的数据用于评估。利用现有的博弈论,k近邻(KNN),支持向量机(SVM)和决策树技术对所提算法的性能进行了测试。结果表明,提出的EGSLA算法在识别假用户方面达到了90.3%的准确率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号