首页> 外文会议>International Conference on Developments in eSystems Engineering >Non-Random Weight Initialisation in Deep Convolutional Networks Applied to Safety Critical Artificial Intelligence
【24h】

Non-Random Weight Initialisation in Deep Convolutional Networks Applied to Safety Critical Artificial Intelligence

机译:深度卷积网络中的非随机重量初始化应用于安全批判人工智能

获取原文

摘要

This paper presents a non-random weight initialisation scheme for convolutional neural network layers. It builds upon previous work that was limited to perceptron layers, but in that work repeatable determinism was achieved with equality in categorisation accuracy between the established random scheme and a linear ramp non-random scheme. This work however, is in Convolutional layers and are the layers that have been responsible for better than human performance in image recognition. The previous perceptron work found that number range was more important rather than the gradient. However, that was due to the fully connected nature of dense layers. Although, in convolutional layers by contrast, there is an order direction implied, and the weights relate to filters rather than image pixel positions, so the weight initialisation is more complex. However, the paper demonstrates a better performance, over the currently established random schemes with convolutional layers. The proposed method also induces earlier learning through the use of striped forms, and as such has less unlearning of the traditionally speckled random forms. That proposed scheme also provides a higher performing accuracy in a single learning session, with improvements of: 3.35% un-shuffled, 2.813% shuffled in the first epoch and 0.521% over the 5 epochs of the model. Of which the first epoch is more relevant as it is the epoch after initialisation. Also the proposed method is repeatable and deterministic, which is also a desirable quality for safety critical applications within image classification. The proposed method is also robust to He initialisation values too, and scored 97.55% accuracy compared to 96.929% accuracy with the Glorot/ Xavier in the traditional random forms, of which the benchmark model was originally optimised with.
机译:本文介绍了卷积神经网络层的非随机重量初始化方案。它在以前的工作中建立在基于Perceptron层的工作,但在该工作中,通过建立的随机方案和线性斜坡非随机方案之间的分类精度等平等实现了可重复的确定性。然而,这项工作是在卷积层中,并且是在图像识别中优于人类性能的负责的层。以前的Perceptron工作发现,数字范围更重要而不是梯度。然而,这是由于致密层的完全连接性质。虽然在卷积层相反,存在暗示的订单方向,并且权重涉及滤波器而不是图像像素位置,因此权重初始化更复杂。然而,本文展示了具有卷积层目前建立的随机方案的更好的性能。所提出的方法还通过使用条纹形式诱导前面的学习,因此,传统上斑点的随机形式的情况下具有较少的无线学习。该拟议方案还在单一学习会议中提供了更高的表现准确性,提高了:3.35%未加工,2.813%在第一个时期洗牌,0.521%在模型的5时代。其中第一个时代更为重要,因为它是初始化后的时期。此外,所提出的方法是可重复的和确定性的,这也是图像分类内安全关键应用的理想质量。该方法的初始化值也是强大的,并且均准确度为97.55%,而具有传统随机形式的Glorot / Xavier的精度为96.929%,其中基准模型最初优化。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号