首页> 外国专利> AUGMENTING ATTENTION-BASED NEURAL NETWORKS TO SELECTIVELY ATTEND TO PAST INPUTS

AUGMENTING ATTENTION-BASED NEURAL NETWORKS TO SELECTIVELY ATTEND TO PAST INPUTS

机译：增强基于关注的神经网络以选择性地参加过去的投入

页面导航

摘要
著录项
相似文献

摘要

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for performing a machine learning task on a network input that is a sequence to generate a network output. In one aspect, one of the methods includes, for each particular sequence of layer inputs: for each attention layer in the neural network: maintaining episodic memory data; maintaining compressed memory data; receiving a layer input to be processed by the attention layer; and applying an attention mechanism over (i) the compressed representation in the compressed memory data for the layer, (ii) the hidden states in the episodic memory data for the layer, and (iii) the respective hidden state at each of the plurality of input positions in the particular network input to generate a respective activation for each input position in the layer input.

机译：方法，系统和设备，包括在计算机存储介质上编码的计算机程序，用于对网络输入执行机器学习任务，该方法是用于生成网络输出的序列。在一个方面，其中一种方法包括用于每个特定层输入的特定序列：对于神经网络中的每个注意层：维护ePiSodic存储器数据;维护压缩的内存数据;接收要由注意层处理的图层输入;并在（i）上的压缩存储器数据中应用于（i）的压缩机制，（ii）在层的影片内存储器数据中的隐藏状态，和（iii）多个中的每一个的相应隐藏状态特定网络输入中的输入位置以生成层输入中的每个输入位置的相应激活。

著录项

公开/公告号WO2021058663A1

专利类型
公开/公告日2021-04-01

原文格式PDF
申请/专利权人 DEEPMIND TECHNOLOGIES LIMITED;
展开▼

申请/专利号WO2020EP76759
发明设计人 RAE JACK WILLIAM;POTAPENKO ANNA;LILLICRAP TIMOTHY PAUL;
展开▼

申请日2020-09-24
分类号G06N3/04;G06N3/08;G06N3;
国家 EP
入库时间 2022-08-24 18:03:49

相似文献

专利
外文文献
中文文献