首页> 外文期刊>International journal of information security and privacy >Privacy Protection in Enterprise Social Networks Using a Hybrid De-Identification System
【24h】

Privacy Protection in Enterprise Social Networks Using a Hybrid De-Identification System

机译:利用混合去识别系统的企业社交网络隐私保护

获取原文
获取原文并翻译 | 示例
           

摘要

Enterprise social networks (ESN) have been widely used within organizations as a communication infrastructure that allows employees to collaborate with each other and share files and documents. The shared documents may contain a large amount of sensitive information that affect the privacy of persons such as phone numbers, which must be protected against any kind of disclosure or unauthorized access. In this study, authors propose a hybrid de-identification system that extract sensitive information from textual documents shared in ESNs. The system is based on both machine learning and rule-based classifiers. Gradient boosted trees (GBTs) algorithm is used as machine learning classifier. Experiments ran on a modified CoNLL 2003 dataset show that GBTs algorithm achieve a very high F1-score (95%). Additionally, the rule-based classifier is consisted of regular expression and gazetteers in order to complement the machine learning classifier. Thereafter, the sensitive information extracted by the two classifiers are merged and encrypted using Format Preserving Encryption method.
机译:企业社交网络(ESN)已广泛使用组织内作为通信基础架构,允许员工互相协作并共享文件和文档。共享文件可能包含大量影响电话号码(如电话号码)的隐私的敏感信息,这必须免受任何类型的披露或未授权的访问。在这项研究中,作者提出了一个混合去识别系统,其从ESN中共享的文本文档提取敏感信息。该系统基于机器学习和基于规则的分类器。梯度提升树(GBT)算法用作机器学习分类器。实验在修改的Conll 2003数据集上运行显示,GBT算法实现了非常高的F1分数(95%)。另外,基于规则的分类器由正则表达式和缩进者组成,以便补充机器学习分类器。此后,使用格式保留加密方法来合并由两个分类器提取的敏感信息。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号