首页> 外文期刊>South African Computer Journal >Benign interpolation of noise in deep learning
【24h】

Benign interpolation of noise in deep learning

机译:深度学习中噪声的良性插值

获取原文
           

摘要

The understanding of generalisation in machine learning is in a state of flux, in part due to the ability of deep learning models to interpolate noisy training data and still perform appropriately on out-of-sample data, thereby contradicting long-held intuitions about the bias-variance tradeoff in learning. We expand upon relevant existing work by discussing local attributes of neural network training within the context of a relatively simple framework. We describe how various types of noise can be compensated for within the proposed framework in order to allow the deep learning model to generalise in spite of interpolating spurious function descriptors. Empirically, we support our postulates with experiments involving overparameterised multilayer perceptrons and controlled training data noise. The main insights are that deep learning models are optimised for training data modularly, with different regions in the function space dedicated to fitting distinct types of sample information. Additionally, we show that models tend to fit uncorrupted samples first. Based on this finding, we propose a conjecture to explain an observed instance of the epoch-wise double-descent phenomenon. Our findings suggest that the notion of model capacity needs to be modified to consider the distributed way training data is fitted across sub-units. Author Biographies North-West University, South Africa Electric, Electronic & Computer Engineering department PhD student Researcher.
机译:泛化的机器学习的理解是在不断变化的状态,部分原因是由于深学习模式的插值嘈杂的训练数据,目前仍在继续外的样本数据来适当地执行,从而违背对偏长期持有的直觉能力 - 学习中的诉讼权衡。我们通过在相对简单的框架的背景下讨论神经网络培训的本地属性,扩展了相关工作。我们描述了如何在所提出的框架内补偿各种类型的噪声,以便尽管插入虚假功能描述符,允许深度学习模型概括。经验上,我们支持我们的假设,与涉及过度分辨的多层的多层感知和控制训练数据噪声的实验。主要见解是,深度学习模型是模块化的培训数据的优化,其中功能空间中的不同区域专用于拟合不同类型的样本信息。此外,我们表明模型倾向于首先拟合未损坏的样本。基于这一发现,我们提出了一种猜想来解释观察到的巨头双滴滴现象的例子。我们的研究结果表明,需要修改模型容量的概念,以考虑培训数据跨子单元安装的分布式方式。作者传记西北大学,南非电气,电子电脑工程系博士生研究员。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号