首页> 外国专利> SYSTEMS AND METHODS FOR PRINCIPLED BIAS REDUCTION IN PRODUCTION SPEECH MODELS

SYSTEMS AND METHODS FOR PRINCIPLED BIAS REDUCTION IN PRODUCTION SPEECH MODELS

机译:生产语音模型中主要偏倚减少的系统和方法

摘要

Described herein are systems and methods to identify and address sources of bias in an end-to-end speech model. In one or more embodiments, the end-to-end model may be a recurrent neural network with two 2D-convolutional input layers, followed by multiple bidirectional recurrent layers and one fully connected layer before a softmax layer. In one or more embodiments, the network is trained end-to-end using the CTC loss function to directly predict sequences of characters from log spectrograms of audio. With optimized recurrent layers and training together with alignment information, some unwanted bias induced by using purely forward only recurrences may be removed in a deployed model.
机译:本文描述了用于识别和解决端到端语音模型中的偏见源的系统和方法。在一个或多个实施例中,端到端模型可以是具有两个2D卷积输入层,其后是多个双向递归层和在softmax层之前的一个完全连接层的递归神经网络。在一个或多个实施例中,使用CTC损失功能对网络进行端到端训练,以直接从音频的对数频谱图预测字符序列。通过优化的递归层和训练以及对齐信息,可以在部署的模型中消除由于仅使用正向递归导致的一些不希望的偏差。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号