首页> 外国专利> TRAINING SERVER AND METHOD FOR GENERATING A PREDICTIVE MODEL OF A NEURAL NETWORK THROUGH DISTRIBUTED REINFORCEMENT LEARNING

TRAINING SERVER AND METHOD FOR GENERATING A PREDICTIVE MODEL OF A NEURAL NETWORK THROUGH DISTRIBUTED REINFORCEMENT LEARNING

机译：通过分布式增强学习，培训服务器和用于生成神经网络预测模型的方法

页面导航

摘要
著录项
相似文献

摘要

Interactions between a training server and a plurality of environment controllers are used for updating the weights of a predictive model used by a neural network executed by the plurality of environment controllers. Each environment controller executes the neural network using a current version of the predictive model to generate outputs based on inputs, modifies the outputs, and generates metrics representative of the effectiveness of the modified outputs for controlling the environment. The training server collects the inputs, the corresponding modified outputs, and the corresponding metrics from the plurality of environment controllers. The collected inputs, modified outputs and metrics are used by the training server for updating the weights of the current predictive model through reinforcement learning. A new predictive model comprising the updated weights is transmitted to the environment controllers to be used in place of the current predictive model.

机译：训练服务器和多个环境控制器之间的交互用于更新由多个环境控制器执行的神经网络使用的预测模型的权重。每个环境控制器使用当前版本的预测模型执行神经网络，以基于输入生成输出，修改输出，并生成代表修改的输出用于控制环境的有效性的度量。训练服务器从多个环境控制器收集输入，相应的修改输出和相应的度量。训练服务器使用收集的输入，修改的输出和度量来通过加固学习更新当前预测模型的权重。包括更新权重的新预测模型被发送到要使用的环境控制器代替当前预测模型。

著录项

公开/公告号EP3805996A1

专利类型
公开/公告日2021-04-14

原文格式PDF
申请/专利权人 DISTECH CONTROLS INC.;
展开▼

申请/专利号EP20200192189
发明设计人 LUPIEN STEVE;GERVAIS FRANCOIS;
展开▼

申请日2020-08-21
分类号G06N3/08;G05B15/02;G05B13/02;
国家 EP
入库时间 2022-08-24 18:12:33

相似文献

专利
外文文献
中文文献