首页> 外文会议>IEEE Canadian Conference on Electrical and Computer Engineering >A content-based data masking technique for a built-in framework in Business Intelligence platform
【24h】

A content-based data masking technique for a built-in framework in Business Intelligence platform

机译:商业智能平台中内置框架的基于内容的数据屏蔽技术

获取原文
获取外文期刊封面目录资料

摘要

The implementation of a Business Intelligence (BI) platform in different types of organizations including healthcare, has become an important project. One common implementation is to extract sensitive data from production databases and then load them into an enterprise Data Warehouse (DW). However, the internal privacy breach that occurs when developers, researchers, and testers access the DW is a serious threat that should be taken into consideration. Reversible data masking techniques are used to de-identify the sensitive data, preserve the data format, and maintain the quality of data utility (data analytics). In this paper a practical built-in data masking framework (IMETUIdentify, Map, Execute, Test, and Utilize) is proposed focusing on the execution and testing modules. The proposed data masking algorithm is derived from the statistical content of the extracted dataset, which is grouped at certain levels (micro-aggregation) that are associated with a numeric attribute. The combination of the related statistical information will be used within a mathematical formula to generate the new masked value, and then the statistical variables will be put together in such a sequence and encapsulated to form a strong pair of public and secret keys. This strengthens the security factor while introducing small overheads in performance and space in comparison with native encryption techniques.
机译:在包括医疗保健在内的不同类型的组织中实施商业智能(BI)平台已成为一项重要的项目。一种常见的实现是从生产数据库中提取敏感数据,然后将它们加载到企业数据仓库(DW)中。但是,当开发人员,研究人员和测试人员访问DW时发生的内部隐私泄露是一个严重的威胁,应予以考虑。可逆数据屏蔽技术用于识别敏感数据,保留数据格式并维护数据实用程序(数据分析)的质量。在本文中,针对执行和测试模块,提出了一个实用的内置数据屏蔽框架(IMETUIidentify,Map,Execute,Test和Utilize)。所提出的数据屏蔽算法是从提取的数据集的统计内容中得出的,该统计内容按与数字属性相关联的特定级别(微观汇总)进行分组。相关统计信息的组合将在数学公式内使用,以生成新的掩码值,然后将统计变量按这种顺序放在一起并封装在一起,以形成一对强大的公用密钥和秘密密钥。与本机加密技术相比,这增强了安全性,同时在性能和空间上引入了较小的开销。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号