机译:非线性神经最优控制的部分政策迭代ADP算法,折扣总奖励
Guangdong Univ Technol Sch Automat Guangzhou 510006 Peoples R China|Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China;
Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China;
Adaptive critic designs; Adaptive dynamic programming; Policy iteration; Neural networks; Neuro-dynamic programming; Nonlinear systems; Optimal control;