Optimal random perturbations for stochastic approximation using a simultaneous perturbation gradient approximation

Sadegh P.; Spall J.C.

首页> 外文期刊>IEEE Transactions on Automatic Control >Optimal random perturbations for stochastic approximation using a simultaneous perturbation gradient approximation

【24h】

Optimal random perturbations for stochastic approximation using a simultaneous perturbation gradient approximation

机译：使用同时扰动梯度逼近的随机逼近的最佳随机扰动

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The simultaneous perturbation stochastic approximation (SPSA) algorithm has attracted considerable attention for challenging optimization problems where it is difficult or impossible to obtain a direct gradient of the objective (say, loss) function. The approach is based on a highly efficient simultaneous perturbation approximation to the gradient based on loss function measurements. SPSA is based on picking a simultaneous perturbation (random) vector in a Monte Carlo fashion as part of generating the approximation to the gradient. This paper derives the optimal distribution for the Monte Carlo process. The objective is to minimize the mean square error of the estimate. The authors also consider maximization of the likelihood that the estimate be confined within a bounded symmetric region of the true parameter. The optimal distribution for the components of the simultaneous perturbation vector is found to be a symmetric Bernoulli in both cases. The authors end the paper with a numerical study related to the area of experiment design.

机译：同步摄动随机逼近（SPSA）算法已经引起了对具有挑战性的优化问题的关注，这些挑战很难或不可能获得目标函数（例如损失）函数的直接梯度。该方法基于基于损耗函数测量值的梯度的高效同时扰动近似。 SPSA基于以蒙特卡洛方式选择同时扰动（随机）矢量作为生成梯度近似值的一部分。本文推导了蒙特卡洛过程的最优分布。目的是使估计的均方误差最小。作者还考虑了将估计值限制在真实参数的有界对称区域内的可能性的最大化。在这两种情况下，同时摄动矢量的分量的最佳分布是对称的伯努利。作者以与实验设计领域相关的数值研究作为结尾。

著录项

来源
《IEEE Transactions on Automatic Control》 |1998年第10期|P.1480-1484|共5页
作者
Sadegh P.; Spall J.C.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化系统;
关键词

相似文献

外文文献
中文文献
专利

1. Optimal random perturbations for stochastic approximation using asimultaneous perturbation gradient approximation [J] . Sadegh P., Spall J.C. IEEE Transactions on Automatic Control . 1998,第10期

机译：使用随机扰动梯度近似的随机近似最优随机扰动
2. Optimal control of polymer flooding based on simultaneous perturbation stochastic approximation method guided by finite difference gradient [J] . Kang Zhou, Jian Hou, Xiansong Zhang, Computers & Chemical Engineering . 2013,第auga8期

机译：基于有限差分梯度的同时摄动随机逼近方法的聚合物驱最优控制
3. Convergence rate of moments in stochastic approximation with simultaneous perturbation gradient approximation and resetting [J] . Gerencser L. IEEE Transactions on Automatic Control . 1999,第5期

机译：随机逼近与同时扰动梯度逼近和重置的矩的收敛速度
4. Optimal random perturbations for stochastic approximation using a simultaneous perturbation gradient approximation [C] . Sadegh, P., Spall, . 1997

机译：使用同时扰动梯度逼近的随机逼近的最佳随机扰动
5. Calibration of traffic simulation models using simultaneous perturbation stochastic approximation (SPSA) method extended through Bayesian sampling methodology. [D] . Lee, Jung-Beom. 2008

机译：使用同时通过贝叶斯采样方法扩展的同时扰动随机逼近（SPSA）方法对交通仿真模型进行校准。
6. Performance analysis of the simultaneous perturbation stochastic approximation algorithm on the noisy sphere model [O] . Steffen Finck, Hans-Georg Beyer -1

机译：噪声球模型同时摄动随机逼近算法的性能分析。
7. Optimal random perturbations for stochastic approximation using a simultaneous perturbation gradient approximation [O] . Sadegh Payman, Spall J. C. 1998

机译：用同时扰动梯度近似的随机逼近的最优随机扰动
8. Randomized Difference Two-Timescale Simultaneous Perturbation Stochastic Approximation Algorithms for Simulation Optimization of Hidden Markov Models. [R] . Bhatnagar, S., Fu, M. C., Marcus, S. I., 2000

机译：随机差分双时间尺度同时扰动随机逼近算法在隐马尔可夫模型仿真优化中的应用。

Optimal random perturbations for stochastic approximation using a simultaneous perturbation gradient approximation

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅