深度多智能体强化学习

摘要

Types of RL algorithms:Value Based,Learnt Value Function,Implicit policy(e.g.∈一greedy);Policy Based,No Value Function,Learnt Policy;Actor-Critic,Learnt Value Function,Learnt Policy.Other(multiagent)RL application scenarios:Auto-driving scenarios,Software Testing(e.g.,Fuzzing,code summarization),Data Mining(Feature Engineering),Cyber-Physical Systems(security check),NLP(image to caption,dialogue generation),Multi-robotic systems,Mililitary Scenarios.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号