首页>
外国专利>
DETERMINING JOURNALIST RISK OF A DATASET USING POPULATION EQUIVALENCE CLASS DISTRIBUTION ESTIMATION
DETERMINING JOURNALIST RISK OF A DATASET USING POPULATION EQUIVALENCE CLASS DISTRIBUTION ESTIMATION
展开▼
机译:用总体等价类分布估计确定数据集的风险
展开▼
页面导航
摘要
著录项
相似文献
摘要
Methods and systems to de-identify a longitudinal dataset of personal records based on journalistic risk computed from a sample set of the personal records, including determining a similarity distribution of the sample set based on quasi-identifiers of the respective personal records, converting the similarity distribution of the sample set to an equivalence class distribution, and computing journalistic risk based on the equivalence distribution. In an embodiment, multiple similarity measures are determined for a personal record based on comparisons with multiple combinations of other personal records of the sample set, and an average of the multiple similarity measures is rounded. In an embodiment, similarity measures are determined for a subset of the sample set and, for each similarity measure, the number of records having the similarity measure is projected to the subset of personal records. Journalistic risk may be computed for multiple types of attacks.
展开▼