首页> 外国专利> USING CORRECTIONS, OF PREDICTED TEXTUAL SEGMENTS OF SPOKEN UTTERANCES, FOR TRAINING OF ON-DEVICE SPEECH RECOGNITION MODEL

USING CORRECTIONS, OF PREDICTED TEXTUAL SEGMENTS OF SPOKEN UTTERANCES, FOR TRAINING OF ON-DEVICE SPEECH RECOGNITION MODEL

机译:使用校正,用于说话的口语识别模型的训练的预测文本段

摘要

Processor(s) of a client device can: receive audio data that captures a spoken utterance of a user of the client device; process, using an on-device speech recognition model, the audio data to generate a predicted textual segment that is a prediction of the spoken utterance; cause at least part of the predicted textual segment to be rendered (e.g., visually and/or audibly); receive further user interface input that is a correction of the predicted textual segment to an alternate textual segment; and generate a gradient based on comparing at least part of the predicted output to ground truth output that corresponds to the alternate textual segment. The gradient is used, by processor(s) of the client device, to update weights of the on-device speech recognition model and/or is transmitted to a remote system for use in remote updating of global weights of a global speech recognition model.
机译:客户端设备的处理器可以:接收捕获客户端设备的用户口语的音频数据;处理,使用On-Device语音识别模型,音频数据来生成预测的文本段,这是口语话语的预测;导致至少部分预测的文本段呈现(例如,视觉和/或可听);接收进一步的用户界面输入,该输入是将预测的文本段校正到备用文本段;并基于将预测的输出的至少一部分进行比较地生成梯度,以对应于备用文本段对应的地面真理输出。通过客户端设备的处理器使用梯度,以更新在设备语音识别模型的权重和/或被发送到远程系统,以用于全局语音识别模型的全局权重的远程更新。

著录项

  • 公开/公告号WO2021045793A1

    专利类型

  • 公开/公告日2021-03-11

    原文格式PDF

  • 申请/专利权人 GOOGLE LLC;

    申请/专利号WO2019US55901

  • 申请日2019-10-11

  • 分类号G10L15/065;

  • 国家 US

  • 入库时间 2022-08-24 17:41:14

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号