Scalable stacking and learning for building deep architectures

机译：用于构建深层架构的可扩展堆叠和学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Deep Neural Networks (DNNs) have shown remarkable success in pattern recognition tasks. However, parallelizing DNN training across computers has been difficult. We present the Deep Stacking Network (DSN), which overcomes the problem of parallelizing learning algorithms for deep architectures. The DSN provides a method of stacking simple processing modules in buiding deep architectures, with a convex learning problem in each module. Additional fine tuning further improves the DSN, while introducing minor non-convexity. Full learning in the DSN is batch-mode, making it amenable to parallel training over many machines and thus be scalable over the potentially huge size of the training data. Experimental results on both the MNIST (image) and TIMIT (speech) classification tasks demonstrate that the DSN learning algorithm developed in this work is not only parallelizable in implementation but it also attains higher classification accuracy than the DNN.

机译：深度神经网络（DNN）在模式识别任务中显示出显着的成功。但是，跨电脑的DNN培训并行化已经很困难。我们介绍了深度堆叠网络（DSN），克服了对深层架构并行化学习算法的问题。 DSN提供了一种堆叠简单处理模块的方法，在构建深度架构中，每个模块中的凸起学习问题。额外的微调进一步改善了DSN，同时引入了轻微的非凸性。 DSN中的全面学习是批量模式，使其适用于许多机器上的并行训练，因此可以通过培训数据的潜在大小进行可扩展。 MNIST（图像）和时间（语音）分类任务的实验结果表明，在该工作中开发的DSN学习算法不仅在实施中并行，而且还比DNN达到更高的分类精度。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2011年||共4页
会议地点
作者
Deng Li;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912-53;
关键词

相似文献

外文文献
中文文献
专利

1. Stacked Autoencoder-Based Deep Reinforcement Learning for Online Resource Scheduling in Large-Scale MEC Networks [J] . Jiang Feibo, Wang Kezhi, Dong Li, Internet of Things Journal, IEEE . 2020,第10期

机译：基于AutoEncoder的大型MEC网络在线资源调度的深度增强学习
2. Deep-learning neural-network architectures and methods: Using component-based models in building-design energy prediction [J] . Sundaravelpandian Singaravel, Johan Suykens, Philipp Geyer Advanced engineering informatics . 2018,第OCTa期

机译：深度学习神经网络架构和方法：在建筑设计能量预测中使用基于组件的模型
3. Parallel and Scalable Deep Learning Algorithms for High Performance Computing Architectures [J] . Sunil Pandey, Naresh Kumar Nagwani, Shrish Verma International Journal of Engineering Trends and Technology . 2021,第4期

机译：高性能计算架构的平行和可扩展的深度学习算法
4. Scalable stacking and learning for building deep architectures [C] . Deng, Li IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP . 2012

机译：可扩展的堆栈和学习以构建深度架构
5. A scalable system architecture for use with free-space optical interconnects in a 3-D stacked processor environment [D] . Rorie, James Fleming, Jr. 1999

机译：可扩展的系统架构，可与3D堆叠处理器环境中的自由空间光互连一起使用
6. Building Large-Scale Quantitative Imaging Databases with Multi-Scale Deep Reinforcement Learning: Initial Experience with Whole-Body Organ Volumetric Analyses [O] . David J. Winkel, Hanns-Christian Breit, Thomas J. Weikert, 2021

机译：具有多尺度深度加强学习的大规模定量成像数据库：全身器官体积分析的初始经验
7. Marginal Deep Architecture: Stacking Feature Learning Modules to Build Deep Learning Models [O] . Guoqiang Zhong, Kang Zhang, Hongxu Wei, 2019

机译：边缘深层架构：堆叠特色学习模块构建深层学习模型

Scalable stacking and learning for building deep architectures

摘要

著录项

相似文献

相关主题

期刊订阅