首页> 外国专利> Attention-based sequence transduction neural networks

Attention-based sequence transduction neural networks

机译:基于关注的序列转换神经网络

摘要

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating an output sequence from an input sequence. In one aspect, one of the systems includes an encoder neural network configured to receive the input sequence and generate encoded representations of the network inputs, the encoder neural network comprising a sequence of one or more encoder subnetworks, each encoder subnetwork configured to receive a respective encoder subnetwork input for each of the input positions and to generate a respective subnetwork output for each of the input positions, and each encoder subnetwork comprising: an encoder self-attention sub-layer that is configured to receive the subnetwork input for each of the input positions and, for each particular input position in the input order: apply an attention mechanism over the encoder subnetwork inputs using one or more queries derived from the encoder subnetwork input at the particular input position.
机译:方法,系统和设备,包括在计算机存储介质上编码的计算机程序,用于从输入序列生成输出序列。在一个方面,其中一个系统包括编码器神经网络,被配置为接收输入序列并生成网络输入的编码表示,编码器神经网络包括一个或多个编码器子网的序列,每个编码器子网被配置为接收相应的编码器子网输入每个输入位置,并为每个输入位置生成相应的子网输出,以及每个编码器子网,包括:编码器自我关注子层,其被配置为接收每个输入的子网输入位置和输入顺序中的每个特定输入位置:使用从特定输入位置处输入的编码器子网输入的一个或多个查询应用于编码器子网输入上的注意机制。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号