Tuning-free step-size adaptation

机译：无关脚尺适配

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Incremental learning algorithms based on gradient descent are effective and popular in online supervised learning, reinforcement learning, signal processing, and many other application areas. An oft-noted drawback of these algorithms is that they include a step-size parameter that needs to be tuned for best performance, which may require manual intervention and significant domain knowledge or additional data. In many cases, an entire vector of step-size parameters (e.g., one for each input feature) needs to be tuned in order to attain the best performance of the algorithm. To address this, several methods have been proposed for adapting step sizes online. For example, Sutton's IDBD method can find the best vector step size for the LMS algorithm, and Schraudolph's ELK1 method, an extension of IDBD to neural networks, has proven effective on large applications, such as 3D hand tracking. However, to date all such step-size adaptation methods have included a tunable step-size parameter of their own, which we call the meta-step-size parameter. In this paper we show that the performance of existing step-size adaptation methods are strongly dependent on the choice of their meta-step-size parameter and that their meta-step-size parameter cannot be set reliably in a problem-independent way. We introduce a series of modifications and normalizations to the IDBD method that together eliminate the need to tune the meta-step-size parameter to the particular problem. We show that the resulting overall algorithm, called Autostep, performs as well or better than the existing step-size adaptation methods on a number of idealized and robot prediction problems and does not require any tuning of its meta-step-size parameter. The ideas behind Autostep are not restricted to the IDBD method and the same principles are potentially applicable to other incremental learning settings, such as reinforcement learning.

机译：基于梯度下降的增量学习算法是有效且流行于在线监督学习，强化学习，信号处理以及许多其他应用领域。这些算法的OFT缺点是它们包括需要调整的步骤大小参数以获得最佳性能，这可能需要手动干预和重要的域知识或附加数据。在许多情况下，需要调整步进参数的整个阶梯大小参数（例如，每个输入特征的传感器）以实现算法的最佳性能。要解决此问题，已提出几种方法来在线调整步骤尺寸。例如，Sutton的IDBD方法可以找到LMS算法的最佳矢量步长，而Schraudolph的ELK1方法是对神经网络的延伸，已经证明对大型应用程序有效，例如3D手跟踪。但是，迄今为止，所有此类步骤大小适配方法都包含了自己的可调步骤大小参数，我们调用元步骤大小参数。在本文中，我们表明，现有的阶梯大小适应方法的性能强烈依赖于它们的元步大小参数的选择，并且它们的元步大小参数无法以与独立问题的方式可靠地设置。我们向IDBD方法介绍了一系列修改和常规，将COMOVINATINATINE消除对特定问题调整元步骤大小参数的需要。我们表明，由此产生的整体算法，称为AutoStep，或者比在许多理想化和机器人预测问题上执行或更好地执行或更好地执行，并且不需要任何调谐其元步长参数。 AutoSTEP背后的想法不限于IDBD方法，相同的原则可能适用于其他增量学习设置，例如加强学习。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2011年||共4页
会议地点
作者
Mahmood A.R.; Sutton R.S.; Degris T.; Pilarski P.M.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912-53;
关键词

相似文献

外文文献
中文文献
专利

1. An Iteration-Based Variable Step-Size Algorithm for Joint Explicit Adaptation of Time Delay [J] . Leiou Wang, Donghui Wang, Feiyang Zheng, Circuits and Systems II: Express Briefs, IEEE Transactions on . 2017,第8期

机译：基于迭代的时延联合显式自适应变步长算法
2. Markov Chain Analysis of Cumulative Step-Size Adaptation on a Linear Constrained Problem [J] . Chotard Alexandre, Auger Anne, Hansen Nikolaus Evolutionary computation . 2015,第4期

机译：线性约束问题上累积步长自适应的马尔可夫链分析
3. Variable tap-length linear equaliser with variable tap-length adaptation step-size [J] . Zhiyong Liu Electronics Letters . 2014,第8期

机译：具有可变抽头长度自适应步长的可变抽头长度线性均衡器
4. Tuning-free step-size adaptation [C] . Mahmood, Ashique Rupam IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP . 2012

机译：免调步长调整
5. Automatic step-size adaptation in incremental supervised learning. [D] . Mahmood, Ashique. 2010

机译：在逐步监督学习中自动调整步长。
6. Tuning-free controller to accurately regulate flow rates in a microfluidic network [O] . Young Jin Heo, Junsu Kang, Min Jun Kim, -1

机译：免调节控制器可精确调节微流体网络中的流速
7. TUNING-FREE STEP-SIZE ADAPTATION [O] . Ashique Rupam, Mahmood Richard, S. Sutton, 2013

机译：无调整步长自适应
8. Variable Step-Size Selection Methods for Implicit Integration Schemes [R] . Doman, D. B. , Holsapple, R. , Iyer, R. V. 2005

机译：隐式积分方案的变步长选择方法

Tuning-free step-size adaptation

摘要

著录项

相似文献

相关主题

期刊订阅