首页> 外文期刊>International journal of machine learning and cybernetics >MS-NET: modular selective network Round robin based modular neural network architecture with limited redundancy
【24h】

MS-NET: modular selective network Round robin based modular neural network architecture with limited redundancy

机译:MS-Net:模块化选择性网络循环基于Robin的模块化神经网络架构,冗余有限

获取原文
获取原文并翻译 | 示例
       

摘要

We propose a modular architecture of Deep Neural Network (DNN) for multi-class classification task. The architecture consists of two parts, a router network and a set of expert networks. In this architecture, for a C-class classification problem, we have exactly C experts. The backbone network for these experts and the router are built with simple and identical DNN architecture. For each class, the modular network has a certain number rho of expert networks specializing in that particular class, where rho is called the redundancy rate in this study. We demonstrate that rho plays a vital role in the performance of the network. Although these experts are light weight and weak learners alone, together they match the performance of more complex DNNs. We train the network in two phase wherein, first the router is trained on the whole set of training data followed by training each expert network enforced by a new stochastic objective function that facilitates alternative training on a small subset of expert data and the whole set of data. This alternative training provides an additional form of regularization and avoids over-fitting the expert network on subset data. During the testing phase, the router dynamically selects a fixed number of experts for further evaluation of the input datum. The modular nature and low parameter requirement of the network makes it very suitable in distributed and low computational environments. Extensive empirical study and theoretical analysis on CIFAR-10, CIFAR-100 and F-MNIST substantiate the effectiveness and efficiency of our proposed modular network.
机译:我们为多级分类任务提出了深度神经网络(DNN)的模块化体系结构。该架构由两个部分,路由器网络和一组专家网络组成。在此架构中,对于C类分类问题,我们完全有C专家。这些专家和路由器的骨干网采用简单且相同的DNN架构构建。对于每个类,模块化网络具有专门从事该研究的特定类的专家网络的一定数量的Rho,其中rho被称为本研究中的冗余率。我们展示RHO在网络性能方面发挥着重要作用。虽然这些专家是轻量重量和弱的学习者,但它们一起与更复杂的DNN的性能相匹配。我们在两阶段训练网络,首先,路由器在整个训练数据上培训,然后通过新的随机目标函数培训强制执行的每个专家网络,这有助于对专家数据和整套的小型子集进行替代培训数据。此替代培训提供了额外的正则化形式,并避免在子集数据上过度拟合专家网络。在测试阶段,路由器动态地选择固定数量的专家,以进一步评估输入数据。网络的模块化性质和低参数要求使其非常适合分布式和低计算环境。 CiFar-10,CiFar-100和F-Mnist的广泛实证研究与理论分析证实了我们所提出的模块化网络的有效性和效率。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号