首页> 外文期刊>Journal of the Royal Statistical Society >Correlates of record linkage and estimating risks of non-linkage biases in business data sets
【24h】

Correlates of record linkage and estimating risks of non-linkage biases in business data sets

机译:记录链接的相关性和估计业务数据集中非链接偏差的风险

获取原文
获取原文并翻译 | 示例
       

摘要

Researchers often utilize data sets that link information from multiple sources, but non-linkage biases caused by linked and non-linked subject differences are little understood, especially in business data sets. We address these knowledge gaps by studying biases in linkable 2010 UK Small Business Survey data sets. We identify correlates of business linkage propensity, and also for the first time its components: consent to linkage and register identifier appendability. As well, we take a novel approach to evaluating non-linkage bias risks, by computing data set representativeness indicators (comparable, decomposable sample subset similarity measures). We find that the main impacts on linkage propensities and bias risks are due to consenter-non-consenter differences explicable given business survey response processes, and differences between subjects with and without identifiers caused by register undercoverage of very small businesses. We then discuss consequences for the analysis of linked business data sets, and implications of the evaluation methods we introduce for linked data set producers and users.
机译:研究人员经常利用链接来自多个来源的信息的数据集,但是,由链接的和非链接的主题差异引起的非链接偏差很少被理解,尤其是在业务数据集中。我们通过研究可链接的2010年英国小型企业调查数据集中的偏见来解决这些知识差距。我们确定了业务链接倾向的相关性,并且首次确定了其组成部分:对链接的同意和注册标识符的可附加性。同样,我们通过计算数据集的代表性指标(可比较的,可分解的样本子集相似性度量),采用一种新颖的方法来评估非链接偏倚风险。我们发现,对链接倾向和偏见风险的主要影响是由于在给定业务调查响应流程的情况下,同意者/不同意者之间的差异是显而易见的,以及由于很小的企业的注册不足而导致具有标识符和没有标识符的主题之间的差异。然后,我们讨论对链接的业务数据集进行分析的后果,以及我们介绍的对链接的数据集生产者和用户的评估方法的含义。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号