Deep Mixture of Experts via Shallow Embedding

机译：通过浅埋嵌入专家的深层混合

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Larger networks generally have greater representational power at the cost of increased computational complexity. Sparsifying such networks has been an active area of research but has been generally limited to static regularization or dynamic approaches using reinforcement learning. We explore a mixture of experts (MoE) approach to deep dynamic routing, which activates certain experts in the network on a per-example basis. Our novel DeepMoE architecture increases the representational power of standard convolutional networks by adaptively sparsifying and recalibrating channel-wise features in each convolutional layer. We employ a multi-headed sparse gating network to determine the selection and scaling of channels for each input, leveraging exponential combinations of experts within a single convolutional network. Our proposed architecture is evaluated on four benchmark datasets and tasks, and we show that Deep-MoEs are able to achieve higher accuracy with lower computation than standard convolutional networks.

机译：较大的网络通常具有更大的代表性功率，以增加计算复杂性。缩小这些网络一直是一个活跃的研究领域，但一般限于使用加强学习的静态正则化或动态方法。我们探索专家（MOE）方法对深动态路由的混合，这在每个示例的基础上激活网络中的某些专家。我们的新型DeepMoe架构通过在每个卷积层中自适应地稀释和重新校准渠道方向特征来增加标准卷积网络的代表性力量。我们采用了多头稀疏的门控网络来确定每个输入的频道的选择和缩放，利用单个卷积网络中的专家的指数组合。我们提出的架构是在四个基准数据集和任务中进行评估，我们表明深部门能够实现比标准卷积网络更低的计算更高的准确性。

著录项

来源
《Conference on Uncertainty in Artificial Intelligence》|2019年|610-1157p|共14页
会议地点
作者
Xin Wang; Fisher Yu; Lisa Dunlap; Yi-An Ma; Ruth Wang; Azalia Mirhoseini; Trevor Darrell; Joseph E. Gonzalez;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. Deep learning at the shallow end: Malware classification for non-domain experts [J] . Le Quan, Boydell Oisin, Mac Namee Brian, Digital investigation . 2018,第JULa期

机译：深度学习的最底层：针对非领域专家的恶意软件分类
2. Embedded local feature selection within mixture of experts [J] . Billy Peralta, Alvaro Soto Information Sciences: An International Journal . 2014,第EP期

机译：混合专家的嵌入式本地特征选择
3. Tree-Gated Deep Mixture-of-Experts for Pose-Robust Face Alignment [J] . Estèphe Arnaud, Arnaud Dapogny, Kévin Bailly IEEE Transactions on Biometrics, Behavior, and Identity Science . 2020,第2期

机译：针对姿势稳健的脸部对齐的树立门的深层混合
4. Deep Mixture of Experts via Shallow Embedding [C] . Xin Wang, Fisher Yu, Lisa Dunlap, Conference on Uncertainty in Artificial Intelligence . 2019

机译：通过浅埋嵌入专家的深层混合
5. Tree-based Deep Mixture of Experts with Applications to Visual Saliency Prediction and Quality Robust Visual Recognition [D] . Dodge, Samuel. 2018

机译：基于树的专家深度混合及其在视觉显着性预测和质量鲁棒的视觉识别中的应用
6. Variational Information Bottleneck for Unsupervised Clustering: Deep Gaussian Mixture Embedding [O] . Yiğit Uğur, George Arvanitakis, Abdellatif Zaidi 2020

机译：无监督聚类的变分信息瓶颈：嵌入深层高斯混合
7. Deep Learning-Based Computational Color Constancy With Convoluted Mixture of Deep Experts (CMoDE) Fusion Technique [O] . Ho-Hyoung Choi, Byoung-Ju Yun 2020

机译：基于深度学习的计算色恒定与深层专家的复杂混合物（CMODE）融合技术
8. Analysis of the Distinction between Deep and Shallow Expert Systems. [R] . Karp, P. D., Wilkins, D. C. 1989

机译：浅析深层与浅层专家系统的区别。

Deep Mixture of Experts via Shallow Embedding

摘要

著录项

相似文献

相关主题

期刊订阅