Hybrid Microdata via Model-Based Clustering

机译：通过基于模型的聚类混合微数据

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we propose a new scheme for statistical disclosure limitation which can be classified as a hybrid method of protection, that is, a method that combines properties of perturbative and synthetic methods. This approach is based on model-based clustering with the subsequent synthesis of the records within each cluster. The novelty is that the clustering and synthesis methods have been carefully chosen to fit each other in view of reducing information loss. The model-based clustering tries to obtain clusters such that the within-cluster data distribution is approximately normal; then we can use a multivariate normal synthesizer for the local synthesis of data. In this way, some of the non-normal characteristics of the data are captured by the clustering, so that a simple synthesizer for normal data can be used within each cluster. Our method is shown to be effective when compared to other disclosure limitation strategies.

机译：在本文中，我们提出了一种统计披露限制的新方案，可以将其归类为一种混合保护方法，即一种将摄动和合成方法的性质相结合的方法。该方法基于基于模型的聚类，随后对每个聚类中的记录进行综合。新颖之处在于，考虑到减少信息丢失，已经仔细选择了聚类和合成方法以相互适应。基于模型的聚类尝试获取聚类，以使聚类内数据分布近似正态。那么我们可以使用多元正态合成器进行数据的本地合成。这样，通过聚类捕获了数据的某些非正常特性，因此可以在每个聚类中使用简单的用于正常数据的合成器。与其他披露限制策略相比，我们的方法被证明是有效的。

著录项

来源
《Privacy in statistical databases》|2012年|103-115|共13页
会议地点 Palermo(IT)
作者
Anna Oganian; Josep Domingo-Ferrer;
展开▼
作者单位

Georgia, Southern University Department of Mathematical Sciences P.O. Box 8093, Statesboro, GA 30460-8093, U.S.A;

Universitat Rovira i Virgili UNESCO Chair in Data Privacy Department of Computer Engineering and Maths Av. Pai'sos Catalans 26, E-43007 Tarragona, Catalonia;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Statistical disclosure limitation (SDL); hybrid SDL methods; mixture models; model-based clustering; expectation-maximization (EM) algorithm;

机译：统计披露限制（SDL）；混合SDL方法混合模型基于模型的聚类；期望最大化（EM）算法;

相似文献

外文文献
中文文献
专利

1. Spatial and non-spatial model-based protection procedures for the release of business microdata [J] . LUISA FRANCONI, JULIAN STANDER Statistics and computing . 2003,第4期

机译：基于空间和非空间模型的保护过程，用于发布业务微数据
2. Hybrid DE-EM Algorithm for Gaussian Mixture Model-Based Wireless Channel Multipath Clustering [J] . Li Yupeng, Zhang Jianhua, He Ruisi, International journal of antennas and propagation . 2019,第PTa4期

机译：基于高斯混合模型的无线通道多路径聚类的混合DE-EM算法
3. Hybrid DE-EM Algorithm for Gaussian Mixture Model-Based Wireless Channel Multipath Clustering [J] . Yupeng Li, Jianhua Zhang, Ruisi He, International journal of antennas and propagation . 2019,第1期

机译：基于高斯混合模型的无线通道多路径聚类的混合DE-EM算法
4. Hybrid Microdata via Model-Based Clustering [C] . Anna Oganian, Josep Domingo-Ferrer International Conference on Privacy in Statistical Databases . 2012

机译：通过基于模型的聚类混合微立数据
5. Energy efficient cluster based hybrid MAC protocol with channel dependent optimized intra-cluster scheduling. [D] . Jagannath, Jithin. 2013

机译：基于能源高效集群的混合MAC协议，具有通道相关的优化集群内调度。
6. The Hybrid Synthetic Microdata Platform: A Method for Statistical Disclosure Control [O] . Joël Kuiper, Edwin R. van den Heuvel, Morris A. Swertz -1

机译：混合合成微数据平台：一种统计披露控制方法
7. Phenotypic Variation in Endangered Texas Salamanders: Application of Model-Based Clustering for Identifying Species and Hybrids [O] . Donella M. Strom, Nathan F. Bendik, Dee Ann Chamberlain, 2020

机译：濒危德克萨斯州蝾螈的表型变异：应用模型的聚类鉴定物种和杂种
8. Incremental Model-Based Clustering for Large Datasets With Small Clusters [R] . Fraley, C. , Raftery, A. , Wehrensy, R. 2003

机译：基于增量模型的聚类适用于具有小集群的大型数据集

Hybrid Microdata via Model-Based Clustering

摘要

著录项

相似文献

相关主题

期刊订阅