首页> 外文期刊>Statistical Journal of the IAOS: Journal of the International Association for Official Statistics >Discussion of the synthetic data papers published in the previous issue
【24h】

Discussion of the synthetic data papers published in the previous issue

机译:讨论上一期发表的综合数据文件

获取原文
获取原文并翻译 | 示例
           

摘要

In our data driven society in which we expect that all major decisions are backed up by empirical evidence based on high quality data, broad access to these data is a must. However, the benefits of broad data access need to be balanced against potential risks of disclosure. Most data gathered by government agencies are collected under the pledge of confidentiality and the agencies have a legal and moral obligation to guarantee this pledge. Furthermore, if respondents get the impression that their data are not sufficiently protected they might refuse to participate or purposely provide wrong answers jeopardizing the quality of the collected data. Statistical agencies thus have to address this trade-off and much progress has been made in the last decades increasing the amount of data available for the general public while maintaining the confidentiality of the survey respondents. Still, there are certain types of data for which addressing this trade-off is particularly difficult. Medical records containing sensitive information on health status are one example, another example are business data. These data are particularly difficult to protect since a few variables usually suffice to identify larger businesses in the data. At the same time the collected information is often sensitive since other establishments might gain an edge if they learn certain attributes about their competitors. For these reasons access to business data is very restricted. Most data collecting agencies do not offer access to their business data and if they do, the data can usually only be analyzed on the premises of the agency by sworn in researchers after a lengthy application process. Finding ways to simplify and broaden the access to business data for external researchers and the general public is thus a topic of intensive research.
机译:在我们期望以数据为基础的社会中,所有重大决策都将得到基于高质量数据的经验证据的支持,因此必须广泛访问这些数据。但是,广泛的数据访问所带来的好处需要与潜在的披露风险进行权衡。政府机构收集的大多数数据都是在保密保证下收集的,这些机构有法律和道义上的义务来保证这一保证。此外,如果受访者觉得他们的数据没有得到充分保护,他们可能会拒绝参与或故意提供错误的答案,从而危害所收集数据的质量。因此,统计机构必须解决这一折衷问题,并且在过去的几十年中已经取得了很大的进步,增加了可供公众使用的数据量,同时又保持了调查受访者的机密性。但是,对于某些类型的数据,要解决这种折衷特别困难。包含有关健康状况敏感信息的医疗记录是一个示例,另一个示例是业务数据。这些数据特别难以保护,因为通常只有几个变量足以确定数据中的大型企业。同时,收集到的信息通常很敏感,因为如果其他机构了解其竞争对手的某些属性,它们可能会获得优势。由于这些原因,对业务数据的访问非常受限。大多数数据收集机构不提供对其业务数据的访问权限,如果这样做,则通常只能在漫长的申请过程之后由研究人员宣誓就职,才能在该机构的前提下对数据进行分析。因此,寻找简化和拓宽外部研究人员和普通公众对业务数据的访问的方法成为了深入研究的主题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号