Asynchronous Parallel Learning for Neural Networks and Structured Models with Dense Features

机译：具有密集特征的神经网络和结构化模型的异步并行学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Existing asynchronous parallel learning methods are only for the sparse feature models, and they face new challenges for the dense feature models like neural networks (e.g., LSTM, RNN). The problem for dense features is that asynchronous parallel learning brings gradient errors derived from overwrite actions. We show that gradient errors are very common and inevitable. Nevertheless, our theoretical analysis shows that the learning process with gradient errors can still be convergent towards the optimum of objective functions for many practical applications. Thus, we propose a simple method AsynGrad for asynchronous parallel learning with gradient error. Base on various dense feature models (LSTM, dense-CRF) and various NLP tasks, experiments show that AsynGrad achieves substantial improvement on training speed, and without any loss on accuracy.

机译：现有的异步并行学习方法仅用于稀疏特征模型，并且对于诸如神经网络（例如，LSTM，RNN）的稠密特征模型面临新的挑战。密集功能的问题在于异步并行学习会带来源自覆盖操作的梯度误差。我们表明，梯度误差非常普遍且不可避免。尽管如此，我们的理论分析表明，对于许多实际应用，具有梯度误差的学习过程仍可以朝着目标函数的最优方向收敛。因此，我们提出了一种用于梯度误差异步并行学习的简单方法AsynGrad。基于各种密集特征模型（LSTM，密集CRF）和各种NLP任务，实验表明，AsynGrad在训练速度上实现了实质性的提高，而准确性没有任何损失。

著录项

来源
《International conference on computational linguistics》|2016年|192-202|共11页
会议地点
作者
Xu Sun;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. High-parallelism Inception-like Spiking Neural Networks for Unsupervised Feature Learning [J] . Meng Mingyuan, Yang Xingyu, Bi Lei, Neurocomputing . 2021,第Juna21期

机译：用于无监督特征学习的高行性初始尖峰神经网络
2. Deep learning the features maps for automated tumor grading of lung nodule structures using convolutional neural networks [J] . Supriya S., Subaji M. Intelligent decision technologies . 2020,第1期

机译：深入学习使用卷积神经网络的肺结节结构自动肿瘤分级的特征图
3. Parallel pathway dense neural network with weighted fusion structure for brain tumor segmentation [J] . Ye Fangyan, Zheng Yingbin, Ye Hao, Neurocomputing . 2021,第Feba15期

机译：脑肿瘤细分加权融合结构的平行通路密集神经网络
4. Asynchronous Parallel Learning for Neural Networks and Structured Models with Dense Features [C] . Xu Sun International conference on computational linguistics . 2016

机译：具有密集功能的神经网络和结构化模型的异步并行学习
5. Prediction of Electronic Component Prices:from Classical Statistical and Machine Learning Models to Deep Neural Networks with Feature Embedding [D] . Zhang, Yu. 2019

机译：电子零件价格的预测：从经典的统计和机器学习模型到具有特征嵌入的深度神经网络
6. Denoising of 3D Brain MR Images with Parallel Residual Learning of Convolutional Neural Network Using Global and Local Feature Extraction [O] . Liang Wu, Shunbo Hu, Changchun Liu 2021

机译：使用全局和局部特征提取与卷积神经网络的平行剩余学习的3D脑MR图像的去噪
7. Kolmogorov width decay and poor approximators in machine learning: shallow neural networks, random feature models and neural tangent kernels [O] . Weinan E, Stephan Wojtowytsch 2021

机译：KOLMOGOROV宽度衰减和机器学习差的近似器：浅层神经网络，随机特征模型和神经切线内核

Asynchronous Parallel Learning for Neural Networks and Structured Models with Dense Features

摘要

著录项

相似文献

相关主题

期刊订阅