首页> 外文会议>Conference on cyber sensing >Challenges to inferring causality from viral information dispersion in dynamic social networks
【24h】

Challenges to inferring causality from viral information dispersion in dynamic social networks

机译:从动态社交网络中的病毒性信息传播推断因果关系的挑战

获取原文

摘要

Understanding the mechanism behind large-scale information dispersion through complex networks has important implications for a variety of industries ranging from cyber-security to public health. With the unprecedented availability of public data from online social networks (OSNs) and the low cost nature of most OSN outreach, randomized controlled experiments, the "gold standard" of causal inference methodologies, have been used with increasing regularity to study viral information dispersion. And while these studies have dramatically furthered our understanding of how information disseminates through social networks by isolating causal mechanisms, there are still major methodological concerns that need to be addressed in future research. This paper delineates why modern OSNs are markedly different from traditional sociological social networks and why these differences present unique challenges to experimentalists and data scientists. The dynamic nature of OSNs is particularly troublesome for researchers implementing experimental designs, so this paper identifies major sources of bias arising from network mutability and suggests strategies to circumvent and adjust for these biases. This paper also discusses the practical considerations of data quality and collection, which may adversely impact the efficiency of the estimator. The major experimental methodologies used in the current literature on virality are assessed at length, and their strengths and limits identified. Other, as-yet-unsolved threats to the efficiency and unbiasedness of causal estimators-such as missing data-are also discussed. This paper integrates methodologies and learnings from a variety of fields under an experimental and data science framework in order to systematically consolidate and identify current methodological limitations of randomized controlled experiments conducted in OSNs.
机译:了解通过复杂网络进行大规模信息传播的机制,对从网络安全到公共卫生的各种行业都具有重要意义。随着来自在线社交网络(OSN)的公共数据的空前可用性以及大多数OSN推广活动的低成本性质,随机对照实验,因果推理方法的“黄金标准”已越来越有规律地用于研究病毒信息的扩散。尽管这些研究通过隔离因果机制极大地增进了我们对信息如何通过社交网络传播的理解,但在未来的研究中仍然需要解决主要的方法论问题。本文描述了为什么现代OSN与传统的社会学社交网络显着不同,以及为什么这些差异给实验家和数据科学家带来了独特的挑战。 OSN的动态性质对于实施实验设计的研究人员特别麻烦,因此,本文确定了网络可变性引起的主要偏差来源,并提出了规避和调整这些偏差的策略。本文还讨论了数据质量和收集的实际考虑,这可能会对估计器的效率产生不利影响。对当前有关病毒学的文献中使用的主要实验方法进行了详细评估,并确定了它们的优缺点。还讨论了因果估计量的效率和公正性尚未解决的其他威胁,例如数据丢失。本文在实验和数据科学框架下整合了来自各个领域的方法论和学习,以便系统地整合和确定OSN中进行的随机对照实验的当前方法学局限性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号