首页>
外国专利>
PRE-TRAINING SYSTEM FOR SELF-LEARNING AGENT IN VIRTUALIZED ENVIRONMENT
PRE-TRAINING SYSTEM FOR SELF-LEARNING AGENT IN VIRTUALIZED ENVIRONMENT
展开▼
机译:虚拟环境中自学代理的预培训系统
展开▼
页面导航
摘要
著录项
相似文献
摘要
A pre-training apparatus and method for reinforcement learning based on a Generative Adversarial Network (GAN) is provided. GAN includes a generator and a discriminator. The method comprising receiving training data from a real environment where the training data includes a data slice corresponding to a first state-reward pair and a first state-action pair, training the GAN using the training data, training a relations network to extract a latent relationship of the first state-action pair with the first state-reward pair in a reinforcement learning context, causing the generator trained with training data to generate first synthetic data, processing a portion of the first synthetic data in the relations network to generate a resulting data slice, merging the second state-action pair portion of the first synthetic data with the second state-reward pair from the relations network to generate second synthetic data to update a policy for interaction with the real environment.
展开▼