EXPLORATORY HJB EQUATIONS AND THEIR CONVERGENCE

WENPIN TANG; YUMING PAUL ZHANG; XUN YU ZHOU

首页> 外文期刊>SIAM Journal on Control and Optimization >EXPLORATORY HJB EQUATIONS AND THEIR CONVERGENCE

【24h】

EXPLORATORY HJB EQUATIONS AND THEIR CONVERGENCE

机译：EXPLORATORY HJB EQUATIONS AND THEIR CONVERGENCE

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

We study the exploratory Hamilton-Jacobi-Bellman (HJB) equation arising from the entropy-regularized exploratory control problem, which was formulated by Wang, Zariphopoulou, and Zhou (J. Mach. Learn. Res., 21 (2020), 198) in the context of reinforcement learning in continuous time and space. We establish the well-posedness and regularity of the viscosity solution to the equation, as well as the convergence of the exploratory control problem to the classical stochastic control problem when the level of exploration decays to zero. We then apply the general results obtained to the exploratory temperature control problem, which was introduced by Gao, Xu, and Zhou (SIAM J. Control Optim., 60 (2022), pp. 1250-1268) to design an endogenous temperature schedule for simulated annealing in the context of nonconvex optimization. We derive an explicit rate of convergence for this problem as exploration diminishes to zero, and find that the stationary distribution of the optimally controlled process exists, which is however neither a Dirac mass on the global optimum nor a Gibbs measure.

著录项

来源
《SIAM Journal on Control and Optimization》 |2022年第6期|3191-3216|共26页
作者
WENPIN TANG; YUMING PAUL ZHANG; XUN YU ZHOU;
展开▼
作者单位

Department of Industrial Engineering and Operations Research, Columbia University, New York, NY USA 10027;

Department of Mathematics, University of California, San Diego, CA USA;

展开▼
收录信息
原文格式 PDF
正文语种英语
中图分类运筹学;控制论、信息论（数学理论）;
关键词
HJB equations; stochastic control; partial differential equations; reinforcement learning; exploratory control; entropy regularization; simulated annealing; overdamped Langevin equation;

EXPLORATORY HJB EQUATIONS AND THEIR CONVERGENCE

摘要

著录项

相关主题

期刊订阅