首页> 外国专利> PUNCTUATION MARK DELETE MODEL TRAINING DEVICE, PUNCTUATION MARK DELETE MODEL, AND DETERMINATION DEVICE

PUNCTUATION MARK DELETE MODEL TRAINING DEVICE, PUNCTUATION MARK DELETE MODEL, AND DETERMINATION DEVICE

机译:标点符号删除模型训练设备,标点符号标记删除模型和确定设备

摘要

This punctuation mark delete model training device generates, through machine training, a punctuation mark delete model for determining the propriety of a punctuation mark imparted to text obtained by a speech recognition process, the device comprising: a first training data generation unit which generates, on the basis of a first text corpus composed from the text obtained by the speech recognition process, first training data composed of a pair of an input sentence, which is formed by a punctuation mark, a preamble that is a sentence in which the punctuation mark is imparted to the end of the sentence, and postamble that is a sentence immediately after the punctuation mark, and a label that indicates the propriety of imparting the punctuation mark; and a model training unit which updates parameters of the punctuation mark delete model on the basis of an error between the label and a probability obtained by inputting the input sentence of the first training data to the punctuation mark delete model.
机译:该标号标记删除模型训练设备通过机器训练产生标点符号标记删除模型,用于确定赋予由语音识别处理获得的文本的标点符号的适当性,该设备包括:第一训练数据生成单元,该第一训练数据生成单元由语音识别过程获得的文本组成的第一文本语料库的基础,第一训练数据由一对输入句子组成,该输入句由标点符号形成,该前序字是标点符号的句子赋予句子的末尾,并在标点符号之后的句子的句子,以及指示赋予标点符号的适当关系的标签;和模型训练单元,基于标签与通过将第一训练数据的输入句子输入到标点符号删除模型而获得的概率来更新标点符号删除模型的参数。

著录项

  • 公开/公告号WO2021215262A1

    专利类型

  • 公开/公告日2021-10-28

    原文格式PDF

  • 申请/专利权人 NTT DOCOMO INC.;

    申请/专利号WO2021JP14931

  • 发明设计人 KATOU TAKU;

    申请日2021-04-08

  • 分类号G06F16/33;G10L15/16;G10L15/22;

  • 国家 JP

  • 入库时间 2022-08-24 21:59:45

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号