Laying foundations for effective machine learning in law enforcement. Majura - A labelling schema for child exploitation materials

Dalins Janis; Tyshetskiy Yuriy; Wilson Campbell; Carman Mark J.; Boudry Douglas

首页> 外文期刊>Digital investigation >Laying foundations for effective machine learning in law enforcement. Majura - A labelling schema for child exploitation materials

【24h】

Laying foundations for effective machine learning in law enforcement. Majura - A labelling schema for child exploitation materials

机译：为执法中有效的机器学习奠定基础。 Majura-儿童剥削材料的标签架构

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The health impacts of repeated exposure to distressing concepts such as child exploitation materials (CEM, aka 'child pornography') have become a major concern to law enforcement agencies and associated entities. Existing methods for 'flagging' materials largely rely upon prior knowledge, whilst predictive methods are unreliable, particularly when compared with equivalent tools used for detecting 'lawful' pornography. In this paper we detail the design and implementation of a deep-learning based CEM classifier, leveraging existing pornography detection methods to overcome infrastructure and corpora limitations in this field. Specifically, we further existing research through direct access to numerous contemporary, real-world, annotated cases taken from Australian Federal Police holdings, demonstrating the dangers of overfitting due to the influence of individual users' proclivities. We quantify the performance of skin tone analysis in CEM cases, showing it to be of limited use. We assess the performance of our classifier and show it to be sufficient for use in forensic triage and 'early warning' of CEM, but of limited efficacy for categorising against existing scales for measuring child abuse severity.We identify limitations currently faced by researchers and practitioners in this field, whose restricted access to training material is exacerbated by inconsistent and unsuitable annotation schemas. Whilst adequate for their intended use, we show existing schemas to be unsuitable for training machine learning (ML) models, and introduce a new, flexible, objective, and tested annotation schema specifically designed for cross-jurisdictional collaborative use.This work, combined with a world-first 'illicit data airlock' project currently under construction, has the potential to bring a 'ground truth' dataset and processing facilities to researchers worldwide without compromising quality, safety, ethics and legality. (C) 2018 Elsevier Ltd. All rights reserved.

机译：反复接触令人痛苦的概念（如儿童剥削材料（CEM，又名“儿童色情制品”））对健康的影响已成为执法机构和相关实体的主要关注点。现有的“举报”材料的方法主要依赖于先验知识，而预测方法却不可靠，尤其是与用于检测“合法”色情内容的等效工具相比时。在本文中，我们详细介绍了基于深度学习的CEM分类器的设计和实现，利用现有的色情内容检测方法来克服该领域中的基础结构和语料库限制。具体来说，我们通过直接访问来自澳大利亚联邦警察持有的众多当代，真实且带有注释的案例，进一步开展了现有研究，证明了由于个人用户的喜好而造成的过度拟合的危险。我们对CEM案例中的肤色分析性能进行量化，表明其用途有限。我们评估了分类器的性能，并表明该分类器足以用于法医分类和CEM的``早期预警''，但根据现有的衡量虐待儿童严重程度的量表进行分类的功效有限。我们确定了研究人员和从业人员当前面临的局限性在该领域中，不一致和不合适的注释模式加剧了其对培训材料的访问受限。尽管适合其预期用途，但我们显示了不适合用于训练机器学习（ML）模型的现有模式，并引入了专门为跨辖区协作使用而设计的新的，灵活的，客观的和经过测试的注释模式。目前正在建设中的世界上第一个“非法数据气闸”项目，有可能在不影响质量，安全性，道德和合法性的前提下，为全球研究人员带来“地面事实”数据集和处理设施。（C）2018 Elsevier Ltd.保留所有权利。

著录项

来源
《Digital investigation》 |2018年第9期|40-54|共15页
作者
Dalins Janis; Tyshetskiy Yuriy; Wilson Campbell; Carman Mark J.; Boudry Douglas;
展开▼
作者单位

Australian Fed Police, Melbourne, Vic, Australia;

Australian Fed Police, Barton, ACT, Australia;

CSIRO, Data61, Eveleigh, NSW, Australia;

Monash Univ, Caulfield, Vic, Australia;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Neural networks; Digital forensics; Child exploitation; Forensic triage; Annotation schema;

机译：神经网络;数字取证;剥削儿童;取证分类;注释模式;

相似文献

外文文献
中文文献
专利

1. Seen and HeardSeen and Heard (e‐Learning Course and Supplementary Training Materials on Building Awareness of Child Sexual Abuse and Exploitation) by the Department of Health and the Children's Society, 2016. Available free: http://learning.seenandheard.org.ukhttp://learning.seenandheard.org.uk [J] . Eldridge Hilary Child abuse review ejournal of the British Association for the Study and Prevention of Child Abuse and Neglect . 2018,第2期

机译：看到并听到了看到并听到了（电子学习课程和补充培训材料，建立健康和儿童协会的儿童社会的对儿童性虐待和剥削的认识）。免费： http://learning.seenandheard.org.uk http://learning.seenandheard.org.uk.

2. Visual Schemas: Pragmatics of Design Learning in Foundations Studios [J] . Mine Ozkar Nexus Network Journal . 2011,第1期

机译：视觉模式：Foundation Studios中的设计学习的语用学

3. Laying Community Foundations for Your Child With a Disability: How to Establish Relationships That Will Support Your Child After You've Gone [J] . Nicola Schafer Intellectual and Developmental Disabilities (Mental Retardation) . 1997,第6期

机译：为您的残疾儿童奠定社区基础：如何建立关系以在您离开后为您的孩子提供支持

4. Laying the Foundations of a Learning Platform for Humanitarian Engineering: Methodological Approach and Results [C] . Andrea Mazzurco, Brent K. Jesiek American Society for Engineering Education Annual Conference and Exposition . 2018

机译：铺设人道主义工程学习平台的基础：方法论方法和结果

5. The effective use of unmanned aerial vehicles for local law enforcement. [D] . Gasque, Leighton. 2015

机译：有效地将无人机用于当地执法。

6. Exploiting the Dynamics of Soft Materials for Machine Learning [O] . Kohei Nakajima, Helmut Hauser, Tao Li, -1

机译：开发用于机器学习的软材料的动力学

7. Exploiting the Dynamics of Soft Materials for Machine Learning [O] . Kohei Nakajima, Helmut Hauser, Tao Li, 2018

机译：利用机器学习软材料的动态

8. Cluster Dynamics: Laying the Foundations for Tailoring the Design of Cluster Assembled Nanoscale Materials [R] . Castleman, J. A. 2009

机译：聚类动力学：为调整聚类纳米材料的设计奠定基础

1. "好孩子"标签对儿童成长的误导及其超越——对幼儿教育中"好孩子"标签现象的反思 [J] . 庞清蜻 ,甘剑梅 . 教育导刊（下半月） . 2017,第009期

2. 基于数据中台架构的信用智能标签服务设计与实现 [J] . 金斌 . 信息与电脑 . 2020,第011期

3. 特殊不干胶材料标签材料家族中的强劲力量 [J] . 贾卫华 . 标签技术 . 2012,第003期

4. 适宜的操作材料有效的投放策略——例谈幼儿园科学活动中操作材料的选择与运用 [J] . 冯德菲1 . 教育观察 . 2019,第003期

5. 数据标签技术在交通应急、管理与执法场景中的应用研究 [J] . 张健 ,陈振宇 . 电脑知识与技术 . 2020,第033期

6. 卫生执法机关在查处食品标签违法案中应注意的问题 [C] . 刘颖 ,许泽春 . 首届全国卫生法规、标准效益评价技术研讨会 . 2000

7. 基于三层架构的自定义标签在劳动就业系统中的研究与实现 [A] . 杨晓光 . 2005

1. 用于机器学习架构中的分层训练的方法、设备和介质 [P] . 中国专利： CN105683944B . 2019.08.09

2. 机器学习中具有密集特征金字塔网络架构的医学图像对象检测 [P] . 中国专利： CN109753866A . 2019-05-14

3. ENGLISH LEARNING MATERIAL SCHEMATIZING TRANSITIVE VERBS OF ENGLISH WITH THREE CONCEPTION AND ENGLISH LEARNING METHOD USING SAME [P] . 外国专利： KR20190087791A . 2019-07-25

机译：三种概念的英语学习材料模式化英语过渡词及其用法的英语学习方法

4. MACHINE FOR LAYING LABELS ON ARTICLES, PARTICULARLY BOTTLES, WITH PRINTING INCORPORATING LABELS, PRECEDING LAYING [P] . 外国专利： FR2441548A1 . 1980-06-13

机译：用于在物品（尤其是瓶子）上铺设标签的机器，带有印刷掺入标签，先于铺设

5. In the label pasting machine, as the cartridge label the label positioner null telescopic label in order it registers to lay out in the height which is beforehand configurated from [P] . 外国专利： JP4703639B2 . 2011-06-15

机译：在标签粘贴机中，作为盒带标签，标签定位器将伸缩标签无效，以便其定位在预先配置的高度中

相关主题

Laying foundations for effective machine learning in law enforcement. Majura - A labelling schema for child exploitation materials

摘要

著录项

相似文献

相关主题

期刊订阅