【24h】

Deep Neural Network Ensembles

机译:深神经网络集合

获取原文

摘要

Current deep neural networks suffer from two problems; first, they are hard to interpret, and second, they suffer from overfitting. There have been many attempts to define interpretability in neural networks, but they typically lack causality or generality. A myriad of regularization techniques have been developed to prevent overfitting, and this has driven deep learning to become the hot topic it is today; however, while most regularization techniques are justified empirically and even intuitively, there is not much underlying theory. This paper argues that to extract the features used in neural networks to make decisions, it's important to look at the paths between clusters existing in the hidden spaces of neural networks. These features are of particular interest because they reflect the true decision-making process of the neural network. This analysis is then furthered to present an ensemble algorithm for arbitrary neural networks which has guarantees for test accuracy. Finally, a discussion detailing the aforementioned guarantees is introduced and the implications to neural networks, including an intuitive explanation for all current regularization methods, are presented. The ensemble algorithm has generated state-of-the-art results for Wide-ResNets on CIFAR-10 (top 5 for all models) and has improved test accuracy for all models it has been applied to.
机译:目前的深神经网络遭受了两个问题;首先,他们很难解释,第二,他们遭受过度装备。有许多尝试在神经网络中定义可解释性,但通常缺乏因果关系或普遍性。已经开发出一种无数的正规化技术来防止过度装备,这导致了深入学习成为今天的热门话题;然而,虽然大多数正则化技术经验甚至直观地是合理的,但是没有太大的潜在理论。本文认为,为了提取神经网络中使用的功能来做出决策,重要的是要查看神经网络隐藏空间中存在的集群之间的路径。这些特征特别感兴趣,因为它们反映了神经网络的真正决策过程。然后进一步进一步分析来呈现用于任意神经网络的集合算法,其具有用于测试精度的保证。最后,引入了详细说明上述担保的讨论,并提出了对神经网络的影响,包括对所有当前正则化方法的直观解释。该集合算法为CIFAR-10上的宽元素产生了最先进的结果(适用于所有型号的前5个),并为其应用于其应用的所有型号具有改进的测试精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号