首页> 外国专利> DEEP REINFORCEMENT LEARNING WITH FAST UPDATING RECURRENT NEURAL NETWORKS AND SLOW UPDATING RECURRENT NEURAL NETWORKS

DEEP REINFORCEMENT LEARNING WITH FAST UPDATING RECURRENT NEURAL NETWORKS AND SLOW UPDATING RECURRENT NEURAL NETWORKS

机译：深度加强学习，快速更新经常性神经网络和缓慢更新经常性神经网络

页面导航

摘要
著录项
相似文献

摘要

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for reinforcement learning. One of the methods includes selecting an action to be performed by the agent using both a slow updating recurrent neural network and a fast updating recurrent neural network that receives a fast updating input that includes the hidden state of the slow updating recurrent neural network.

机译：方法，系统和设备，包括在计算机存储介质上编码的计算机程序，用于加强学习。其中一个方法包括使用慢速更新经常性神经网络选择由代理执行的动作，以及快速更新的复发性神经网络，其接收到包括慢速更新经常性神经网络的隐藏状态的快速更新输入。

著录项

公开/公告号US2021097373A1

专利类型
公开/公告日2021-04-01

原文格式PDF
申请/专利权人 DEEPMIND TECHNOLOGIES LIMITED;
展开▼

申请/专利号US202017121679
发明设计人 IAIN ROBERT DUNNING;WOJCIECH CZARNECKI;MAXWELL ELLIOT JADERBERG;
展开▼

申请日2020-12-14
分类号G06N3/04;G06F17/18;G06N3/08;
国家 US
入库时间 2022-08-24 18:01:21

相似文献

专利
外文文献
中文文献