首页> 美国卫生研究院文献>other >Structural Effects of Network Sampling Coverage I: Nodes Missing at Random
【2h】

Structural Effects of Network Sampling Coverage I: Nodes Missing at Random

机译:网络采样覆盖率的结构效应I:随机丢失的节点

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Network measures assume a census of a well-bounded population. This level of coverage is rarely achieved in practice, however, and we have only limited information on the robustness of network measures to incomplete coverage. This paper examines the effect of node-level missingness on 4 classes of network measures: centrality, centralization, topology and homophily across a diverse sample of 12 empirical networks. We use a Monte Carlo simulation process to generate data with known levels of missingness and compare the resulting network scores to their known starting values. As with past studies (; ), we find that measurement bias generally increases with more missing data. The exact rate and nature of this increase, however, varies systematically across network measures. For example, betweenness and Bonacich centralization are quite sensitive to missing data while closeness and in-degree are robust. Similarly, while the tau statistic and distance are difficult to capture with missing data, transitivity shows little bias even with very high levels of missingness. The results are also clearly dependent on the features of the network. Larger, more centralized networks are generally more robust to missing data, but this is especially true for centrality and centralization measures. More cohesive networks are robust to missing data when measuring topological features but not when measuring centralization. Overall, the results suggest that missing data may have quite large or quite small effects on network measurement, depending on the type of network and the question being posed.
机译:网络措施假设人口普查。但是,在实践中很少会达到这种程度的覆盖范围,而关于网络措施对不完全覆盖范围的鲁棒性,我们只有有限的信息。本文研究了节点级别缺失对4类网络度量的影响:在12个经验网络的不同样本中的中心性,集中性,拓扑和同构性。我们使用蒙特卡洛模拟过程来生成具有已知缺失水平的数据,并将所得的网络得分与其已知的起始值进行比较。与以往的研究(;)一样,我们发现,随着数据的丢失,测量偏差通常会增加。但是,这种增长的确切速度和性质在网络措施之间会系统地变化。例如,中间度和Bonacich集中度对丢失的数据非常敏感,而紧密度和入度则很健壮。同样,尽管缺少数据很难捕获tau统计量和距离,但即使缺少程度很高,传递性也几乎没有偏差。结果显然也取决于网络的功能。更大,更集中的网络通常对丢失的数据更健壮,但是对于集中性和集中化措施尤其如此。在测量拓扑特征时,更多的内聚网络对于丢失数据具有鲁棒性,而在测量集中性时则没有。总体而言,结果表明,丢失的数据可能会对网络测量产生很大或非常小的影响,具体取决于网络的类型和提出的问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号