摘要:Types of RL algorithms:Value Based,Learnt Value Function,Implicit policy(e.g.∈一greedy);Policy Based,No Value Function,Learnt Policy;Actor-Critic,Learnt Value Function,Learnt Policy.Other(multiagent)RL application scenarios:Auto-driving scenarios,Software Testing(e.g.,Fuzzing,code summarization),Data Mining(Feature Engineering),Cyber-Physical Systems(security check),NLP(image to caption,dialogue generation),Multi-robotic systems,Mililitary Scenarios.