LSTM: A Search Space Odyssey

Klaus Greff; Rupesh K. Srivastava; Jan Koutník; Bas R. Steunebrink; Jürgen Schmidhuber

首页> 外文期刊>Neural Networks and Learning Systems, IEEE Transactions on >LSTM: A Search Space Odyssey

【24h】

LSTM: A Search Space Odyssey

机译：LSTM：搜索太空漫游

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Several variants of the long short-term memory (LSTM) architecture for recurrent neural networks have been proposed since its inception in 1995. In recent years, these networks have become the state-of-the-art models for a variety of machine learning problems. This has led to a renewed interest in understanding the role and utility of various computational components of typical LSTM variants. In this paper, we present the first large-scale analysis of eight LSTM variants on three representative tasks: speech recognition, handwriting recognition, and polyphonic music modeling. The hyperparameters of all LSTM variants for each task were optimized separately using random search, and their importance was assessed using the powerful functional ANalysis Of VAriance framework. In total, we summarize the results of 5400 experimental runs (≈15 years of CPU time), which makes our study the largest of its kind on LSTM networks. Our results show that none of the variants can improve upon the standard LSTM architecture significantly, and demonstrate the forget gate and the output activation function to be its most critical components. We further observe that the studied hyperparameters are virtually independent and derive guidelines for their efficient adjustment.

机译：自1995年成立以来，已针对循环神经网络提出了长期短期记忆（LSTM）架构的几种变体。近年来，这些网络已成为解决各种机器学习问题的最先进模型。这引起了人们对重新了解典型LSTM变体的各种计算组件的作用和实用性的兴趣。在本文中，我们针对三种代表性任务，对八个LSTM变体进行了首次大规模分析：语音识别，手写识别和和弦音乐建模。使用随机搜索分别优化了每个任务的所有LSTM变体的超参数，并使用功能强大的VAriance VAriance框架评估了它们的重要性。总的来说，我们总结了5400次实验运行的结果（约15年的CPU时间），这使我们的研究成为LSTM网络上同类研究中规模最大的一次。我们的结果表明，这些变体都不能显着改善标准LSTM体系结构，并且证明“遗忘门”和输出激活功能是其最关键的组成部分。我们进一步观察到，所研究的超参数实际上是独立的，并为其有效调整导出了指导原则。

著录项

来源
《Neural Networks and Learning Systems, IEEE Transactions on》 |2017年第10期|2222-2232|共11页
作者
Klaus Greff; Rupesh K. Srivastava; Jan Koutník; Bas R. Steunebrink; Jürgen Schmidhuber;
展开▼
作者单位

Istituto Dalle Molle di studi sull’Intelligenza Artificiale, Scuola universitaria professionale della Svizzera italiana, Manno, Switzerland;

Istituto Dalle Molle di studi sull’Intelligenza Artificiale, Scuola universitaria professionale della Svizzera italiana, Manno, Switzerland;

Istituto Dalle Molle di studi sull’Intelligenza Artificiale, Scuola universitaria professionale della Svizzera italiana, Manno, Switzerland;

Istituto Dalle Molle di studi sull’Intelligenza Artificiale, Scuola universitaria professionale della Svizzera italiana, Manno, Switzerland;

Istituto Dalle Molle di studi sull’Intelligenza Artificiale, Scuola universitaria professionale della Svizzera italiana, Manno, Switzerland;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Logic gates; Computer architecture; Training; Microprocessors; Speech recognition; Handwriting recognition; Recurrent neural networks;

机译：逻辑门;计算机体系结构;培训;微处理器;语音识别;手写识别;递归神经网络;

相似文献

外文文献
中文文献
专利

1. The odyssey from and return to the neuraxial space: The search for the optimal interfascial plane block to provide postoperative analgesia following breast surgery continues [J] . Lai Tan H., Pai Poonam B. H., Rosenblatt Meg A. Journal of clinical anesthesia . 2019,第期

机译：奥德赛来自并返回神经节空间：寻找最佳的血流平面块，以提供乳房手术后术后镇痛的延续
2. Search for Traces of Chemically Bound Water in the Martian Surface Layer Based on HEND Measurements onboard the 2001 Mars Odyssey Spacecraft [J] . A. T. Basilevsky, M. L. Litvak, I. G. Mitrofanov, Solar system research . 2003,第5期

机译：基于2001年Mars Odyssey航天器上的HAND测量，搜索火星表面层中化学结合水的痕迹
3. Search for Water in Martian Soil Using Global Neutron Mapping by the Russian HEND Instrument Onboard the US 2001 Mars Odyssey Spacecraft [J] . I. G. Mitrofanov, M. L. Litvak, A. S. Kozyrev, Solar system research . 2003,第5期

机译：美国2001年火星奥德赛航天器上的俄罗斯HEND仪器利用全球中子测绘在火星土壤中寻找水。
4. Simplified LSTM unit and search space probability exploration for image description [C] . Oliver Nina, Andres Rodriguez International Conference on Information, Communications and Signal Processing . 2015

机译：用于图像描述的简化LSTM单位和搜索空间概率探索
5. 2001: A space odyssey. Law, space, and society in contemporary Israel [D] . Rosen-Zvi, Issachar. 2002

机译：2001年：太空漫游。当代以色列的法律，空间与社会
6. A (3D-Nuclear) Space Odyssey: Making Sense of Hi-C Maps [O] . Irene Mota-Gómez, Darío G. Lupiáñez 2019

机译：（3D-核）太空漫游：理解Hi-C地图
7. LSTM: A Search Space Odyssey [O] . Greff, Klaus, Srivastava, Rupesh Kumar, Koutník, Jan, 2017

机译：LsTm：搜索空间奥德赛
8. Humans in Outer Space: Interdisciplinary Odysseys [R] . 2008

机译：外太空中的人类：跨学科的奥德赛

LSTM: A Search Space Odyssey

摘要

著录项

相似文献

相关主题

期刊订阅