首页> 外文期刊>IASSIST Quarterly >Mathematics, risk, and messy survey data
【24h】

Mathematics, risk, and messy survey data

机译:数学,风险和杂乱调查数据

获取原文
           

摘要

Research funder mandates, such as those from the U.S. National Science Foundation (2011), the Canadian Tri-Agency (draft, 2018), and the UK Economic and Social Research Council (2018) now often include requirements for data curation, including where possible data sharing in an approved archive. Data curators need to be prepared for the potential that researchers who have not previously shared data will need assistance with cleaning and depositing datasets so that they can meet these requirements and maintain funding. Data de-identification or anonymization is a major ethical concern in cases where survey data is to be shared, and one which data professionals may find themselves ill-equipped to deal with. This article is intended to provide an accessible and practical introduction to the theory and concepts behind data anonymization and risk assessment, will describe a couple of case studies that demonstrate how these methods were carried out on actual datasets requiring anonymization, and discuss some of the difficulties encountered. Much of the literature dealing with statistical risk assessment of anonymized data is abstract and aimed at computer scientists and mathematicians, while material aimed at practitioners often does not consider more recent developments in the theory of data anonymization. We hope that this article will help bridge this gap.
机译:研究资助者的任务,如来自美国国家科学基金会(2011年),加拿大三局(草案,2018),以及英国经济与社会研究委员会(2018)现在经常包括用于数据策展的要求,包括在可能的情况在批准的归档数据共享。数据策展人需要的潜力,做好准备,谁以前没有共享的数据将需要清洗和存放数据集,使他们能够满足这些要求,并保持资金援助的研究人员。数据去标识或匿名在案件的主要道德问题,其中的调查数据是被共享,以及一个数据的专业人士可能会发现自己没有能力处理。本文旨在提供一个方便和实用的介绍的理论和数据匿名化和风险评估背后的概念,将介绍几个案例研究,展示这些方法是如何在要求匿名的实际数据集进行,并讨论一些困难遭遇。许多文献与交易的匿名数据的统计风险评估是抽象的,针对计算机科学家和数学家,而材料旨在从业人员往往不考虑在数据匿名化的理论最近的事态发展。我们希望这篇文章能够帮助缩小这一差距。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号