首页> 外文会议>Privacy in Statistical Databases >Accounting for Intruder Uncertainty Due toSampling When Estimating Identification Disclosure Risks in Partially Synthetic Data

【24h】

Accounting for Intruder Uncertainty Due toSampling When Estimating Identification Disclosure Risks in Partially Synthetic Data

机译：在估计部分合成数据中的标识披露风险时，应考虑抽样导致的入侵者不确定性

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Partially synthetic data comprise the units originally surveyed with some collected values, such as sensitive values at high risk of disclosure or values of key identifiers, replaced with multiple draws from statistical models. Because the original records remain on the file, intruders may be able to link those records to external databases, even though values are synthesized. We illustrate how statistical agencies can evaluate the risks of identification disclosures before releasing such data. We compute risk measures when intruders know who is in the sample and when the intruders do not know who is in the sample. We use classification and regression trees to synthesize data from the U.S. Current Population Survey.

机译：部分合成数据包括最初使用一些收集的值（例如，处于高披露风险的敏感值或关键标识符的值）进行调查的单位，并用统计模型的多次抽取代替。由于原始记录保留在文件中，因此入侵者也许可以将这些记录链接到外部数据库，即使值是合成的。我们将说明统计机构在发布此类数据之前如何评估身份披露的风险。当入侵者知道样本中的人以及入侵者不知道样本中的人时，我们将计算风险度量。我们使用分类树和回归树来综合来自美国当前人口调查的数据。

著录项

来源
《Privacy in Statistical Databases》|2008年|P.227-238|共12页
会议地点 Istanbul(TK);Istanbul(TK)
作者
Joerg Drechsler; Jerome P. Reiter;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类 TP311.13;TP11.13;
关键词
CART; disclosure; risk; synthetic data;

机译：CART;披露;风险;综合数据;
入库时间 2022-08-26 14:22:27

相似文献

外文文献
中文文献
专利

1. Assessing disclosure risks for synthetic data with arbitrary intruder knowledge [J] . David McClure, Jerome P. Reiter Statistical Journal of the IAOS: Journal of the International Association for Official Statistics . 2016,第1期

机译：利用任意入侵者知识评估合成数据的披露风险
2. Generating partially synthetic geocoded public use data with decreased disclosure risk by using differential smoothing [J] . Quick Harrison, Holan Scott H., Wikle Christopher K. Journal of the Royal Statistical Society . 2018,第pta3期

机译：通过使用差分平滑来生成披露风险降低的部分合成地理编码公共用途数据
3. Estimating Risks of Identification Disclosure in Microdata [J] . Jerome P. REITER Journal of the American statistical association . 2005,第472期

机译：估计微数据中身份披露的风险
4. Accounting for Intruder Uncertainty Due to Sampling When Estimating Identification Disclosure Risks in Partially Synthetic Data [C] . Jorg Drechsler, Jerome P. Reiter Data Privacy International COnference . 2008

机译：由于在估算部分合成数据中估算识别披露风险时，因采样而核算入侵者不确定性
5. Quantitative Risk Assessment of Natural and Cut Slopes: Measuring Uncertainty in the Estimated Risks and Proposed Framework for Developing Risk Evaluation Criteria. [D] . Macciotta, Renato. 2013

机译：天然和坡度的定量风险评估：测量估计风险的不确定性以及制定风险评估标准的拟议框架。
6. Disclosure Control using Partially Synthetic Data for Large-Scale Health Surveys with Applications to CanCORS [O] . Bronwyn Loong, Alan M. Zaslavsky, Yulei He, -1

机译：使用部分合成数据进行大规模健康调查的信息披露控制并应用于CanCORS
7. Estimating Risks of Identification Disclosure in Partially Synthetic Data [O] . Reiter, Jerome P, Mitra, Robin 2009

机译：估计部分合成数据中身份披露的风险
8. Accounting for uncertainty in systematic bias in exposure estimates used in relative risk regression [R] . Gilbert, E. S. 1995

机译：考虑相对风险回归中使用的暴露估计的系统偏差的不确定性

Accounting for Intruder Uncertainty Due toSampling When Estimating Identification Disclosure Risks in Partially Synthetic Data

摘要

著录项

相似文献

相关主题

期刊订阅