首页> 外文会议>IEEE International Conference on Data Science in Cyberspace >Uncovering and Characterizing Internet Water Army in Online Forums
【24h】

Uncovering and Characterizing Internet Water Army in Online Forums

机译:在在线论坛中发现和表征互联网水军

获取原文
获取外文期刊封面目录资料

摘要

Online forum provides a popular platform for users to communicate and share experiences. Comparing to the social networks which require users to register with their real identities, online forum has no such requirement which makes its users more willing to speak the truth. Unfortunately, the anonymity makes online forum more likely to be abused by malicious users. Internet water army is a typical bad phenomenon which is not only harmful to the benefits of ordinary Internet users, normal companies, but also detrimental to the social stability and the national security. Timely find out and clean up Internet water army in online forum has important significance for boosting the user experience, improving the credibility of network information and maintaining the network space security. In this paper, we propose a novel divide-and-conquer online forum Internet water army detection algorithm according to the fact that Internet water army always appear in groups, echo each other and work in collusion. The major innovations of this paper could be summarized as the following 4 points. Firstly, we propose a new measure of online forum user behavior similarity which compares the behaviors of user pairs from 3 aspects. Secondly, we put forward an interesting social network model in which edges between two users are built if they have similar behaviors. Then we prune the network by deleting the edges whose similarity is below a certain threshold and adopt a hierarchical clustering algorithm on the pruned network to find user groups who work highly cooperatively. Thirdly, we divide the whole dataset into a great many small subsets according to the discussion thread IDs and process all the subsets in parallel, reducing the time complexity greatly. We evaluate our method using the real dataset of Sina Forum and the experimental results show that our algorithm can detect Internet water army in online forums effectively and the accuracy of our algorithm is high. We conduct empirical analysis to the Internet water army we detected and find that their behavior patterns are very different from normal users'. The findings verify the correctness of our algorithm and lay the foundation for characteristic-based Internet water army detection.
机译:在线论坛为用户提供了一个交流和分享经验的流行平台。与要求用户注册其真实身份的社交网络相比,在线论坛没有这样的要求,这使其用户更愿意讲真话。不幸的是,匿名性使在线论坛更容易被恶意用户滥用。互联网水军是一种典型的不良现象,不仅危害普通互联网用户,普通企业的利益,而且不利于社会稳定和国家安全。在线论坛中及时发现和清理互联网水军,对于提升用户体验,提高网络信息的可信度和维护网络空间安全具有重要意义。针对互联网水军总是成群结队,相互呼应,共谋工作这一事实,提出了一种新颖的“分治网络”互联网水军检测算法。本文的主要创新可以归纳为以下四个方面。首先,我们提出了一种在线论坛用户行为相似性的新度量,从三个方面对用户对的行为进行了比较。其次,我们提出了一个有趣的社交网络模型,在该模型中,如果两个用户具有相似的行为,则会在他们之间建立边缘。然后,我们通过删除相似度低于某个阈值的边缘来修剪网络,并在修剪后的网络上采用分层聚类算法,以找到具有高度协作性的用户组。第三,我们根据讨论线程ID将整个数据集划分为许多小的子集,并并行处理所有子集,从而大大降低了时间复杂度。我们使用新浪论坛的真实数据集对我们的方法进行了评估,实验结果表明,该算法可以有效地检测在线论坛中的互联网水军,算法的准确性很高。我们对检测到的互联网水军进行了实证分析,发现他们的行为模式与正常用户的行为模式大不相同。这些发现证明了我们算法的正确性,为基于特征的互联网水军检测奠定了基础。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号