首页> 外国专利> METHOD AND DEVICE FOR VECTORIZING TRANSLATOR'S TRANSLATION PERSONALITY CHARACTERISTICS

METHOD AND DEVICE FOR VECTORIZING TRANSLATOR'S TRANSLATION PERSONALITY CHARACTERISTICS

机译:保护翻译者翻译人格特征的方法和装置

摘要

The embodiments of the present disclosure provide a method and a device for vectorizing a translator's translation personality characteristics. The method comprises: constructing a bilingual sample set on the basis of historical translation corpus of a plurality of translators; inputting the bilingual sample set into a word vector model, and outputting word vectors; inputting each pair of bilingual corpus samples of the bilingual corpus sample set and the corresponding word vectors into an LSTM network-based encoder model and decoder model to perform training; and inputting the bilingual corpus set of each translator into the trained LSTM network to perform training, keeping the parameters of the encoder model unchanged, and obtaining the LSTM network corresponding to said translator; and generating a translator vector on the basis of the parameters of the decoder models in the LSTM network corresponding to said translator. According to the embodiments of the present disclosure, there is no need to filter the personality characteristics of a translator, and there is also no need to manually annotate sample data, so that the training cost is low and the accuracy is high, and the obtained translator vector can accurately and objectively reflect the translation personality characteristics of the translator.
机译:本公开的实施例提供了一种用于向量化翻译者的翻译个性特征的方法和设备。该方法包括:基于多个翻译者的历史翻译语料库构建双语样本集;以及将双语样本集输入词向量模型,并输出词向量;将双语语料样本集的每对双语语料样本和对应的词向量输入到基于LSTM网络的编码器模型和解码器模型中进行训练;将每个翻译器的双语语料集输入训练后的LSTM网络进行训练,保持编码器模型的参数不变,得到与该翻译器相对应的LSTM网络;根据LSTM网络中与所述翻译器对应的解码器模型的参数,生成翻译器矢量。根据本发明实施例,不需要对翻译人员的个性特征进行过滤,也不需要人工注释样本数据,从而训练成本低,准确性高,所得到的译者向量可以准确,客观地反映译者的翻译个性特征。

著录项

  • 公开/公告号WO2020124674A1

    专利类型

  • 公开/公告日2020-06-25

    原文格式PDF

  • 申请/专利权人 IOL (WUHAN) INFORMATION TECHNOLOGY CO. LTD.;

    申请/专利号WO2018CN124915

  • 发明设计人 ZHANG MU;

    申请日2018-12-28

  • 分类号G06F17/27;G06F17/28;

  • 国家 WO

  • 入库时间 2022-08-21 11:10:37

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号