MS-NET: modular selective network Round robin based modular neural network architecture with limited redundancy

Chowdhury Intisar Md; Su Kai; Zhao Qiangfu

首页> 外文期刊>International journal of machine learning and cybernetics >MS-NET: modular selective network Round robin based modular neural network architecture with limited redundancy

【24h】

MS-NET: modular selective network Round robin based modular neural network architecture with limited redundancy

机译：MS-Net：模块化选择性网络循环基于Robin的模块化神经网络架构，冗余有限

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相关主题

摘要

We propose a modular architecture of Deep Neural Network (DNN) for multi-class classification task. The architecture consists of two parts, a router network and a set of expert networks. In this architecture, for a C-class classification problem, we have exactly C experts. The backbone network for these experts and the router are built with simple and identical DNN architecture. For each class, the modular network has a certain number rho of expert networks specializing in that particular class, where rho is called the redundancy rate in this study. We demonstrate that rho plays a vital role in the performance of the network. Although these experts are light weight and weak learners alone, together they match the performance of more complex DNNs. We train the network in two phase wherein, first the router is trained on the whole set of training data followed by training each expert network enforced by a new stochastic objective function that facilitates alternative training on a small subset of expert data and the whole set of data. This alternative training provides an additional form of regularization and avoids over-fitting the expert network on subset data. During the testing phase, the router dynamically selects a fixed number of experts for further evaluation of the input datum. The modular nature and low parameter requirement of the network makes it very suitable in distributed and low computational environments. Extensive empirical study and theoretical analysis on CIFAR-10, CIFAR-100 and F-MNIST substantiate the effectiveness and efficiency of our proposed modular network.

机译：我们为多级分类任务提出了深度神经网络（DNN）的模块化体系结构。该架构由两个部分，路由器网络和一组专家网络组成。在此架构中，对于C类分类问题，我们完全有C专家。这些专家和路由器的骨干网采用简单且相同的DNN架构构建。对于每个类，模块化网络具有专门从事该研究的特定类的专家网络的一定数量的Rho，其中rho被称为本研究中的冗余率。我们展示RHO在网络性能方面发挥着重要作用。虽然这些专家是轻量重量和弱的学习者，但它们一起与更复杂的DNN的性能相匹配。我们在两阶段训练网络，首先，路由器在整个训练数据上培训，然后通过新的随机目标函数培训强制执行的每个专家网络，这有助于对专家数据和整套的小型子集进行替代培训数据。此替代培训提供了额外的正则化形式，并避免在子集数据上过度拟合专家网络。在测试阶段，路由器动态地选择固定数量的专家，以进一步评估输入数据。网络的模块化性质和低参数要求使其非常适合分布式和低计算环境。 CiFar-10，CiFar-100和F-Mnist的广泛实证研究与理论分析证实了我们所提出的模块化网络的有效性和效率。

著录项

来源
《International journal of machine learning and cybernetics》 |2021年第3期|763-781|共19页
作者
Chowdhury Intisar Md; Su Kai; Zhao Qiangfu;
展开▼
作者单位

Univ Aizu Syst Intelligence Lab Aizu Wakamatsu Fukushima 9658580 Japan;

Univ Aizu Syst Intelligence Lab Aizu Wakamatsu Fukushima 9658580 Japan;

Univ Aizu Syst Intelligence Lab Aizu Wakamatsu Fukushima 9658580 Japan;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Modular neural networks; Deep learning; Knowledge-distillation; Multi-class classification; Image classification;

机译：模块化神经网络;深度学习;知识蒸馏;多级分类;图像分类;
入库时间 2022-08-19 01:19:53

MS-NET: modular selective network Round robin based modular neural network architecture with limited redundancy

摘要

著录项

相关主题

期刊订阅