首页> 外文会议>IEEE International Conference on Data Science in Cyberspace >Uncovering and Characterizing Internet Water Army in Online Forums
【24h】

Uncovering and Characterizing Internet Water Army in Online Forums

机译:在网上论坛中揭开和特征互联网水陆

获取原文

摘要

Online forum provides a popular platform for users to communicate and share experiences. Comparing to the social networks which require users to register with their real identities, online forum has no such requirement which makes its users more willing to speak the truth. Unfortunately, the anonymity makes online forum more likely to be abused by malicious users. Internet water army is a typical bad phenomenon which is not only harmful to the benefits of ordinary Internet users, normal companies, but also detrimental to the social stability and the national security. Timely find out and clean up Internet water army in online forum has important significance for boosting the user experience, improving the credibility of network information and maintaining the network space security. In this paper, we propose a novel divide-and-conquer online forum Internet water army detection algorithm according to the fact that Internet water army always appear in groups, echo each other and work in collusion. The major innovations of this paper could be summarized as the following 4 points. Firstly, we propose a new measure of online forum user behavior similarity which compares the behaviors of user pairs from 3 aspects. Secondly, we put forward an interesting social network model in which edges between two users are built if they have similar behaviors. Then we prune the network by deleting the edges whose similarity is below a certain threshold and adopt a hierarchical clustering algorithm on the pruned network to find user groups who work highly cooperatively. Thirdly, we divide the whole dataset into a great many small subsets according to the discussion thread IDs and process all the subsets in parallel, reducing the time complexity greatly. We evaluate our method using the real dataset of Sina Forum and the experimental results show that our algorithm can detect Internet water army in online forums effectively and the accuracy of our algorithm is high. We conduct empirical analysis to the Internet water army we detected and find that their behavior patterns are very different from normal users'. The findings verify the correctness of our algorithm and lay the foundation for characteristic-based Internet water army detection.
机译:在线论坛为用户提供了一个流行的平台,用于沟通和分享体验。与我们要求用户注册其真实身份的社交网络相比,在线论坛没有这样的要求,使其用户更愿意说实话。不幸的是,匿名使在线论坛更有可能被恶意用户滥用。互联网水力是一种典型的糟糕现象,不仅对普通互联网用户,普通公司的好处有害,而且对社会稳定和国家安全的利益有害。在线论坛及时清理互联网水资源,对提高用户体验,提高网络信息的可信度并保持网络空间安全性具有重要意义。在本文中,我们提出了一种小说族和征服在线论坛互联网水力检测算法,即互联网水力总是出现在群体中,互相回应并在勾结中工作。本文的主要创新可以归纳为以下4分。首先,我们提出了一种新的在线论坛用户行为相似性,这比比较了3个方面的用户对的行为。其次,我们提出了一个有趣的社交网络模型,其中如果它们具有相似的行为,建立了两个用户之间的边缘。然后我们通过删除相似性低于某个阈值的边缘来修剪网络,并在修剪网络上采用分层聚类算法来查找高度合作工作的用户组。第三,我们根据讨论线程IDS将整个数据集分为大量的小型子集并并行处理所有子集,大大减少了时间复杂度。我们使用新浪论坛的真实数据集进行评估我们的方法,实验结果表明,我们的算法有效地检测在线论坛中的互联网水力,算法的准确性很高。我们对我们检测到的互联网水道进行实证分析,并发现其行为模式与普通用户的行为模式非常不同。调查结果验证了我们算法的正确性,并为基于特征的互联网水力检测奠定了基础。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号