首页> 外文期刊>IEEE Transactions on Automatic Control >Optimal random perturbations for stochastic approximation using a simultaneous perturbation gradient approximation
【24h】

Optimal random perturbations for stochastic approximation using a simultaneous perturbation gradient approximation

机译:使用同时扰动梯度逼近的随机逼近的最佳随机扰动

获取原文
获取原文并翻译 | 示例
       

摘要

The simultaneous perturbation stochastic approximation (SPSA) algorithm has attracted considerable attention for challenging optimization problems where it is difficult or impossible to obtain a direct gradient of the objective (say, loss) function. The approach is based on a highly efficient simultaneous perturbation approximation to the gradient based on loss function measurements. SPSA is based on picking a simultaneous perturbation (random) vector in a Monte Carlo fashion as part of generating the approximation to the gradient. This paper derives the optimal distribution for the Monte Carlo process. The objective is to minimize the mean square error of the estimate. The authors also consider maximization of the likelihood that the estimate be confined within a bounded symmetric region of the true parameter. The optimal distribution for the components of the simultaneous perturbation vector is found to be a symmetric Bernoulli in both cases. The authors end the paper with a numerical study related to the area of experiment design.
机译:同步摄动随机逼近(SPSA)算法已经引起了对具有挑战性的优化问题的关注,这些挑战很难或不可能获得目标函数(例如损失)函数的直接梯度。该方法基于基于损耗函数测量值的梯度的高效同时扰动近似。 SPSA基于以蒙特卡洛方式选择同时扰动(随机)矢量作为生成梯度近似值的一部分。本文推导了蒙特卡洛过程的最优分布。目的是使估计的均方误差最小。作者还考虑了将估计值限制在真实参数的有界对称区域内的可能性的最大化。在这两种情况下,同时摄动矢量的分量的最佳分布是对称的伯努利。作者以与实验设计领域相关的数值研究作为结尾。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号