首页> 外文会议>International conference on availability, reliability and security >A k-anonymity method based on search engine query statistics for disaster impact statements
【24h】

A k-anonymity method based on search engine query statistics for disaster impact statements

机译:一种基于搜索引擎查询灾难灾害的k-匿名方法

获取原文

摘要

Privacy is a major concern in the management of big data, especially for datasets that contain sensitive personal information. Personal information is frequently used in marketing analyses, and we can also use it to evaluate the damage situation at the time of a disaster. One model that is widely used to protect privacy is k-anonymity, which can be generally defined as a clustering method in which any record in a dataset is indistinguishable from at least (k-1) other records in the same dataset. Most approaches to k-anonymity suffer from huge information loss due to the abstraction of continuous numerical and categorical attributes that have a hierarchical structure. It is difficult to use conventional k-anonymity with actual Internet services because of the computational complexity and value loss stemming from the loss of information. In this paper, we propose an anonymous algorithm that can respond to both the marketing and disaster analyzing. In ordinary times, we can analyze personal data with this algorithm using SEM price, and in times of disaster, we ensure information anonymity according to the number of times a searched word appears and distribute only the necessary information. This approach makes it possible to calculate only the necessary data and to maintain a sufficient k-anonym zed level. Application of this method to actual data showed that using an index number of the occurrences of the search term makes it is possible to anonymize the information with preferentially partitioning disaster locations.
机译:隐私是管理大数据的主要问题,特别是对于包含敏感个人信息的数据集。个人信息经常用于营销分析,我们还可以使用它来评估灾难时的损坏情况。广泛用于保护隐私的一个模型是k-匿名,它通常被定义为群集方法,其中数据集中的任何记录都无法从同一数据集中的至少(k-1)其他记录中无法区分。由于具有分层结构的连续数值和分类属性的抽象,大多数对k-匿名的方法遭受了巨大的信息丢失。由于从信息丢失的计算复杂性和价值损失,难以使用实际互联网服务使用传统的k-匿名性。在本文中,我们提出了一种匿名算法,可以响应营销和灾害分析。在普通时代,我们可以使用SEM价格分析本算法的个人数据,以及在灾难中,我们确保信息匿名根据搜索字的次数出现并仅分发必要的信息。这种方法使得只能计算必要的数据并保持足够的k-anyony zed级别。这种方法在实际数据中的应用显示,使用搜索项的索引数量使得可以将信息视为优先分区灾难位置的信息。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号