首页> 外文学位 >Statistical tools for disclosure limitation in multi-way contingency tables.

【24h】

Statistical tools for disclosure limitation in multi-way contingency tables.

机译：多向列联表中限制披露的统计工具。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Disseminating information from a k-way cross-classification of non-negative counts n typically corresponds to the release of various lower order marginals, or equivalently subsets of the k variables. This thesis exploits the theory of graphical models to characterize classes of tables

W

induced by several possibly overlapping marginals. Any set of released marginals can be used to define an independence graph. In the special case when these marginals correspond to the minimal sufficient statistics of a decomposable graphical model, the theory yields explicit formulas for sharp upper and lower bounds on the cell entries of tables in

W

. In addition, these bound results are related to the Markov basis used to induce probability distributions over

W

. In the decomposable case, simple “data swaps” or moves are the only moves required to construct a Markov basis that links all the contingency tables in

W

. The approach for computing bounds and for generating Markov bases developed in the thesis generalizes to the case when the released marginals correspond to a reducible independence graph. For an arbitrary set of released marginals, explicit formulas for sharp bounds are not available, and some form of iterative algorithm is required. The method developed in this thesis is a generalization of the shuttle algorithm proposed by Buzzigoli and Giusti. This generalized shuttle algorithm can be modified to enumerate all the tables in the class

W

and to find a controlled rounding of a table of arbitrary dimension. The last part of this thesis studies probability distribution functions defined on spaces of tables induced by a set of marginal totals. Through examples and discussion the thesis illustrates the practical values of the bound and distribution results for assessing the disclosure risk for categorical data.

机译：从非负计数 n 的 k 交叉分类中传播信息通常对应于释放各种较低阶边际或 k的等效子集变量。本文利用图形化模型的理论来刻画由几个可能重叠的边际引起的表

W 的类。可以使用任何一组已发布的边际量来定义独立性图。在特殊情况下，当这些边际对应于可分解图形模型的最小充分统计量时，该理论为 W 。此外，这些绑定结果与用于在 W 上引起概率分布的马尔可夫基础有关。在可分解的情况下，简单的“数据交换”或移动是构造将链接所有 W W 。本文开发的计算边界和生成马尔可夫基数的方法推广到释放边际对应于可约独立图的情况。对于任意一组已发布的边际，没有用于尖锐边界的显式公式，因此需要某种形式的迭代算法。本文提出的方法是对Buzzigoli和Giusti提出的穿梭算法的概括。可以修改此通用的穿梭算法，以枚举 W 类中的所有表，并找到任意维度的表的受控舍入。本文的最后一部分研究了由一组边际总数引发的在表空间上定义的概率分布函数。通过实例和讨论，本文阐述了边界和分布结果对评估分类数据披露风险的实用价值。

著录项

作者
Dobra, Adrian.;
展开▼
作者单位

Carnegie Mellon University.;

展开▼
授予单位 Carnegie Mellon University.;
学科 Information Science.; Sociology Theory and Methods.; Statistics.
学位 Ph.D.
年度 2002
页码 296 p.
总页数 296
原文格式 PDF
正文语种 eng
中图分类信息与知识传播;社会学理论与方法论;统计学;
关键词

相似文献

外文文献
中文文献
专利

1. Is Sharing De-identified Data Legal? The State of Public Health Confidentiality Laws and Their Interplay with Statistical Disclosure Limitation Techniques [J] . Richardson Victor, Milam Sallie, Chrysler Denise The Journal of law, medicine & ethics: a journal of the American Society of Law, Medicine & Ethics . 2015,第1Suppla期

机译：共享去识别数据合法吗？《公共卫生保密法状况》及其与统计披露限制技术的相互影响
2. Economic Analysis and Statistical Disclosure Limitation [J] . Abowd John M., Schmutte Ian M. Brookings Papers on Economic Activity . 2015,第SPRING期

机译：经济分析与统计披露限制
3. Statistical Disclosure Limitation Research and Practice: Fascinating and Growing Areas of Importance [J] . Jerry Reiter Chance . 2012,第1期

机译：统计披露限制研究与实践：引人入胜的重要领域
4. Multivariate Top-Coding for Statistical Disclosure Limitation [C] . Anna Oganian, Ionut Iacob, Goran Lesaja UNESCO chair in data privacy international conference on privacy in statistical databases . 2020

机译：统计披露限制的多元顶部编码
5. Sampling contingency tables given sets of marginals and/or conditionals in the context of statistical disclosure limitation. [D] . Lee, Juyoun. 2009

机译：在统计披露限制的情况下，给定的边际和/或条件集抽样列联表。
6. Statistical tests for 2 X 2 tables. [O] . I D Hill 1985

机译：2 X 2表的统计检验。
7. Algebraic statistics and contingency table problems: Log-linear models, likelihood estimation and disclosure limitation [O] . Adrian Dobra, Stephen E. Fienberg, Alessandro Rinaldo, 2008

机译：代数统计和列联表问题：对数线性模型，似然估计和披露限制
8. Setting an Agenda for Research in the Federal Statistical System: Needs for Statistical Disclosure Limitation Procedures [R] . Cox, L. H., Zayatz, L. V. 1993

机译：制定联邦统计系统研究议程：统计披露限制程序的需要

Statistical tools for disclosure limitation in multi-way contingency tables.

摘要

著录项

相似文献

相关主题

期刊订阅