...
机译:off-policy交错<内联 - 公式>
Northeastern Univ State Key Lab Synthet Automat Proc Ind Shenyang 110819 Liaoning Peoples R China|Liaoning Shihua Univ Sch Informat & Control Engn Fushun 113001 Peoples R China;
Northeastern Univ State Key Lab Synthet Automat Proc Ind Shenyang 110819 Liaoning Peoples R China|Northeastern Univ Int Joint Res Lab Integrated Automat Shenyang 110819 Liaoning Peoples R China;
Northeastern Univ State Key Lab Synthet Automat Proc Ind Shenyang 110819 Liaoning Peoples R China|Northeastern Univ Int Joint Res Lab Integrated Automat Shenyang 110819 Liaoning Peoples R China|Univ Texas Arlington UTA Res Inst Arlington TX 76118 USA;
Univ Manchester Sch Elect & Elect Engn Manchester M13 9PL Lancs England;
Northeastern Univ State Key Lab Synthet Automat Proc Ind Shenyang 110819 Liaoning Peoples R China|Northeastern Univ Int Joint Res Lab Integrated Automat Shenyang 110819 Liaoning Peoples R China;
Affine nonlinear systems; interleaved learning; off-policy learning; optimal control; Q-learning;
机译:非策略交错的
机译:高度双折射且非线性的AsSe
机译:加权-
机译:离散仿射非线性系统基于深度强化学习的有限视野最优控制
机译:不确定系统的最优跟踪控制:基于策略和基于策略的强化学习方法
机译:具有信道衰落扇区非线性以及随机出现的间隔延迟和非线性的离散时间系统的模糊...公式输出反馈控制
机译:一个高度双折射和非线性ASSE