首页> 外国专利> Neural machine translation systems with rare word processing

Neural machine translation systems with rare word processing

机译:具有罕见文字处理功能的神经机器翻译系统

摘要

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for neural translation systems with rare word processing. One of the methods is a method training a neural network translation system to track the source in source sentences of unknown words in target sentences, in a source language and a target language, respectively and includes deriving alignment data from a parallel corpus, the alignment data identifying, in each pair of source and target language sentences in the parallel corpus, aligned source and target words; annotating the sentences in the parallel corpus according to the alignment data and a rare word model to generate a training dataset of paired source and target language sentences; and training a neural network translation model on the training dataset.
机译:用于具有稀有文字处理的神经翻译系统的方法,系统和装置,包括在计算机存储介质上编码的计算机程序。方法之一是训练神经网络翻译系统以分别以源语言和目标语言跟踪目标句子中未知词的源句子中的源的方法,并且该方法包括从并行语料库中获取对齐数据,该对齐数据在平行语料库的每一对源语言和目标语言句子中,识别对齐的源语言和目标词;根据对齐数据和稀有词模型对平行语料库中的句子进行注释,以生成源语言和目标语言句子配对的训练数据集;在训练数据集上训练神经网络翻译模型。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号