Enhanced Privacy and Data Protection using Natural Language Processing and Artificial Intelligence

机译：使用自然语言处理和人工智能增强隐私和数据保护

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Artificial Intelligence systems have enabled significant benefits for users and society, but whilst the data for their feeding are always increasing, a side to privacy and security leaks is offered. The severe vulnerabilities to the right to privacy obliged governments to enact specific regulations to ensure privacy preservation in any kind of transaction involving sensitive information. In the case of digital and/or physical documents comprising sensitive information, the right to privacy can be preserved by data obfuscation procedures. The capability of recognizing sensitive information for obfuscation is typically entrusted to the experience of human experts, who are over-whelmed by the ever increasing amount of documents to process. Artificial intelligence could proficiently mitigate the effort of the human officers and speed up processes. Anyway, until enough knowledge won’t be available in a machine readable format, automatic and effectively working systems can’t be developed. In this work we propose a methodology for transferring and leveraging general knowledge across specific-domain tasks. We built, from scratch, specific-domain knowledge data sets, for training artificial intelligence models supporting human experts in privacy preserving tasks. We exploited a mixture of natural language processing techniques applied to unlabeled domain-specific documents corpora for automatically obtain labeled documents, where sensitive information are recognized and tagged. We performed preliminary tests just over 10.000 documents from the healthcare and justice domains. Human experts supported us during the validation. Results we obtained, estimated in terms of precision, recall and F1-score metrics across these two domains, were promising and encouraged us to further investigations.

机译：人工智能系统已经为用户和社会带来了巨大的好处，但是尽管提供给他们的数据总是在增加，但是却提供了隐私和安全漏洞的一面。隐私权的严重漏洞迫使政府制定特定法规，以确保在涉及敏感信息的任何类型的交易中保护隐私。在包含敏感信息的数字和/或物理文件的情况下，可以通过数据混淆程序来维护隐私权。识别敏感信息以进行混淆的能力通常取决于人类专家的经验，这些专家对不断增加的要处理的文档数量感到不知所措。人工智能可以有效减轻人员的工作量并加快流程。无论如何，除非无法以机器可读的格式提供足够的知识，否则将无法开发自动有效的工作系统。在这项工作中，我们提出了一种跨特定领域任务转移和利用常识的方法。我们从头开始构建了特定领域的知识数据集，用于训练支持人类专家进行隐私保护任务的人工智能模型。我们利用了自然语言处理技术的混合物，将其应用于未标记的领域特定文档语料库，以自动获取已标记的文档，从而在其中识别并标记了敏感信息。我们仅从医疗保健和司法领域进行了超过10,000份文档的初步测试。在验证过程中，人类专家为我们提供了支持。我们获得的结果，根据这两个领域的准确性，召回率和F1得分指标进行了估算，这是有希望的，并鼓励我们进行进一步的研究。

著录项

来源
《International Joint Conference on Neural Networks》|2020年|1-8|共8页
会议地点
作者
Fabio Martinelli; Fiammetta Marulli; Francesco Mercaldo; Stefano Marrone; Antonella Santone;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Machine learning; Natural language processing; Data protection; Privacy; Medical services; Task analysis;

机译：机器学习;自然语言处理;数据保护;隐私;医疗服务;任务分析;

相似文献

外文文献
中文文献
专利

1. Comparative legal study on privacy and personal data protection for robots equipped with artificial intelligence: looking at functional and technological aspects [J] . Kaori Ishii AI & society . 2019,第3期

机译：配备人工智能的机器人的隐私和个人数据保护的比较法律研究：研究功能和技术方面
2. Artificial intelligence approaches using natural language processing to advance EHR-based clinical research [J] . Juhn Young, Liu Hongfang The Journal of Allergy and Clinical Immunology . 2020,第2期

机译：利用自然语言处理的人工智能方法推进基于EHR的临床研究
3. Expert artificial intelligence-based natural language processing characterises childhood asthma [J] . Hee Yun Seol, Mary C Rolfes, Wi Chung, BMJ Open Respiratory Research . 2020,第1期

机译：基于专家的人工智能自然语言处理表征儿童哮喘
4. Audit Process Framework for Data Protection and Privacy Compliance Using Artificial Intelligence and Cognitive Services in Smart Cities [C] . Jose Huerta, Pablo Salazar IEEE International Smart Cities Conference . 2018

机译：智慧城市中使用人工智能和认知服务的数据保护和隐私合规性审计流程框架
5. A MODAL TEMPORAL LOGIC FOR REASONING ABOUT CHANGING DATABASES WITH APPLICATIONS TO NATURAL LANGUAGE QUESTION ANSWERING (ARTIFICIAL INTELLIGENCE). [D] . MAYS, ERIC KEENER. 1984

机译：一种模态时态逻辑，用于推理变更数据库，并将其应用于自然语言问题回答（人工智能）。
6. How Artificial Intelligence Can Improve Our Understanding of the Genes Associated with Endometriosis: Natural Language Processing of the PubMed Database [O] . J. Bouaziz, R. Mashiach, S. Cohen, -1

机译：人工智能如何提高我们对与子宫内膜异位症相关基因的理解：PubMed数据库的自然语言处理
7. Commission Communication on the protection of individuals in relation to the processing of personal data in the Community and information security. Proposal for a Council Directive concerning the protection of individuals in relation to the processing of personal data. Draft Resolution of the Representatives of the Governments of the Member States of the European Communities meeting within the Council. Commission Declaration on the application to the institutions and other bodies of the European Communities of the principles contained in the Council Directive concerning the protection of individuals in relation to the processing of personal data. Proposal for a Council Directive concerning the protection of personal data and privacy in the context of public digital telecommunications networks, in particular the integrated services digital network (ISDN) and public digital mobile networks. Recommendation of a Council Decision on the opening of negotiations with a view to the sccession of the European Communities to the Council of Europe Convention for the protection of individuals with regard to the automatic processing of personal data. Proposal for a Council Decision in the field of information security. COM (90) 314 final, 13 September 1990 [O] . 1990

机译：委员会关于保护个人在社区中处理个人数据和信息安全的沟通。关于保护与个人数据处理有关的个人的理事会指令的提案。欧洲共同体成员国政府代表在理事会内举行会议的决议草案。委员会关于向欧洲共同体机构和其他机构申请的关于保护个人数据处理方面的理事会指令所载原则的声明。关于在公共数字电信网络，特别是综合业务数字网（IsDN）和公共数字移动网络中保护个人数据和隐私的理事会指令的提案。建议理事会关于开放谈判的决定，以期欧洲共同体对欧洲委员会的让步，以保护个人自动处理个人数据。关于理事会在信息安全领域的决定的提案。 COm（90）314决赛，1990年9月13日
8. Directions in Artificial Intelligence: Natural Language Processing. [R] . Grishman, R. 1975

机译：人工智能中的方向：自然语言处理。

Enhanced Privacy and Data Protection using Natural Language Processing and Artificial Intelligence

摘要

著录项

相似文献

相关主题

期刊订阅