首页> 外国专利> Unified Endpointer Using Multitask and Multidomain Learning

Unified Endpointer Using Multitask and Multidomain Learning

机译:使用多任务和多域学习的统一端点

摘要

A method for training an endpointer model includes short-form speech utterances and long-form speech utterances. The method also includes providing a short-form speech utterance as input to a shared neural network, the shared neural network configured to learn shared hidden representations suitable for both voice activity detection (VAD) and end-of-query (EOQ) detection. The method also includes generating, using a VAD classifier, a sequence of predicted VAD labels and determining a VAD loss by comparing the sequence of predicted VAD labels to a corresponding sequence of reference VAD labels. The method also includes, generating, using an EOQ classifier, a sequence of predicted EOQ labels and determining an EOQ loss by comparing the sequence of predicted EOQ labels to a corresponding sequence of reference EOQ labels. The method also includes training, using a cross-entropy criterion, the endpointer model based on the VAD loss and the EOQ loss.
机译:用于训练终结者模型的方法包括短形式语音和长形式语音。该方法还包括提供简短的语音话语作为对共享神经网络的输入,该共享神经网络被配置为学习适用于语音活动检测(VAD)和查询结束(EOQ)检测的共享隐藏表示。该方法还包括使用VAD分类器生成预测的VAD标签的序列,以及通过将预测的VAD标签的序列与参考VAD标签的对应序列进行比较来确定VAD损失。该方法还包括:使用EOQ分类器生成预测的EOQ标记的序列,并通过将预测的EOQ标记的序列与参考EOQ标记的相应序列进行比较来确定EOQ损失。该方法还包括使用交叉熵标准训练基于VAD损失和EOQ损失的终结者模型。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号