...
机译:一种新型的多步Q学习方法,可提高深度强化学习的数据效率
South China Univ Technol, Coll Automat Sci & Engn, Guangzhou 510641, Guangdong, Peoples R China;
South China Univ Technol, Coll Automat Sci & Engn, Guangzhou 510641, Guangdong, Peoples R China;
South China Univ Technol, Coll Automat Sci & Engn, Guangzhou 510641, Guangdong, Peoples R China;
Guangdong Univ Technol, Sch Automat, Guangzhou 510006, Guangdong, Peoples R China;
South China Univ Technol, Coll Automat Sci & Engn, Guangzhou 510641, Guangdong, Peoples R China;
South China Univ Technol, Coll Automat Sci & Engn, Guangzhou 510641, Guangdong, Peoples R China;
South China Univ Technol, Coll Automat Sci & Engn, Guangzhou 510641, Guangdong, Peoples R China;
South China Univ Technol, Coll Automat Sci & Engn, Guangzhou 510641, Guangdong, Peoples R China;
Deep reinforcement learning; Robotics; Multi-step methods; Data efficiency;
机译:一种新型多步Q学习方法,提高深增强学习数据效率
机译:SMARTFCT:提高具有深度增强学习的数据中心网络的功率效率
机译:使用政策梯度优化和Q-Learning避免深增强学习碰撞
机译:深度强化学习:从Q学习到深度Q学习
机译:关于游戏的深度加固学习:多重政策头部深度Q学的泛化
机译:通过使用深度强化学习进行患者调度来提高急诊科效率
机译:多步法对深增强学学习高估的影响