Scalable stacking and learning for building deep architectures

机译：可扩展的堆栈和学习以构建深度架构

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Deep Neural Networks (DNNs) have shown remarkable success in pattern recognition tasks. However, parallelizing DNN training across computers has been difficult. We present the Deep Stacking Network (DSN), which overcomes the problem of parallelizing learning algorithms for deep architectures. The DSN provides a method of stacking simple processing modules in buiding deep architectures, with a convex learning problem in each module. Additional fine tuning further improves the DSN, while introducing minor non-convexity. Full learning in the DSN is batch-mode, making it amenable to parallel training over many machines and thus be scalable over the potentially huge size of the training data. Experimental results on both the MNIST (image) and TIMIT (speech) classification tasks demonstrate that the DSN learning algorithm developed in this work is not only parallelizable in implementation but it also attains higher classification accuracy than the DNN.

机译：深度神经网络（DNN）在模式识别任务中显示出了惊人的成功。但是，在计算机之间并行化DNN训练非常困难。我们提出了深度堆栈网络（DSN），该网络克服了针对深度架构并行化学习算法的问题。 DSN提供了一种在深度架构中堆叠简单处理模块的方法，每个模块中都有一个凸学习问题。附加的微调进一步改善了DSN，同时引入了较小的非凸性。 DSN中的完全学习是批处理模式，使其可以在许多机器上进行并行训练，因此可以在可能巨大的训练数据规模上进行扩展。在MNIST（图像）和TIMIT（语音）分类任务上的实验结果表明，这项工作中开发的DSN学习算法不仅在实现上可并行化，而且比DNN具有更高的分类精度。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP》|2012年|p.2133- 2136|共4页
会议地点 Kyoto(JP)
作者
Deng, Li;
展开▼
作者单位

Microsoft Research Redmond USA;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
入库时间 2022-08-26 14:38:22

相似文献

外文文献
中文文献
专利

1. Stacked Autoencoder-Based Deep Reinforcement Learning for Online Resource Scheduling in Large-Scale MEC Networks [J] . Jiang Feibo, Wang Kezhi, Dong Li, Internet of Things Journal, IEEE . 2020,第10期

机译：基于AutoEncoder的大型MEC网络在线资源调度的深度增强学习
2. Deep-learning neural-network architectures and methods: Using component-based models in building-design energy prediction [J] . Sundaravelpandian Singaravel, Johan Suykens, Philipp Geyer Advanced engineering informatics . 2018,第OCTa期

机译：深度学习神经网络架构和方法：在建筑设计能量预测中使用基于组件的模型
3. Parallel and Scalable Deep Learning Algorithms for High Performance Computing Architectures [J] . Sunil Pandey, Naresh Kumar Nagwani, Shrish Verma International Journal of Engineering Trends and Technology . 2021,第4期

机译：高性能计算架构的平行和可扩展的深度学习算法
4. Scalable stacking and learning for building deep architectures [C] . Deng Li IEEE International Conference on Acoustics, Speech and Signal Processing . 2011

机译：用于构建深层架构的可扩展堆叠和学习
5. A scalable system architecture for use with free-space optical interconnects in a 3-D stacked processor environment [D] . Rorie, James Fleming, Jr. 1999

机译：可扩展的系统架构，可与3D堆叠处理器环境中的自由空间光互连一起使用
6. Building Large-Scale Quantitative Imaging Databases with Multi-Scale Deep Reinforcement Learning: Initial Experience with Whole-Body Organ Volumetric Analyses [O] . David J. Winkel, Hanns-Christian Breit, Thomas J. Weikert, 2021

机译：具有多尺度深度加强学习的大规模定量成像数据库：全身器官体积分析的初始经验
7. Marginal Deep Architecture: Stacking Feature Learning Modules to Build Deep Learning Models [O] . Guoqiang Zhong, Kang Zhang, Hongxu Wei, 2019

机译：边缘深层架构：堆叠特色学习模块构建深层学习模型

Scalable stacking and learning for building deep architectures

摘要

著录项

相似文献

相关主题

期刊订阅