首页> 外国专利> Unified Endpointer Using Multitask and Multidomain Learning

Unified Endpointer Using Multitask and Multidomain Learning

机译：使用多任务和多域学习的统一端点

页面导航

摘要
著录项
相似文献

摘要

A method for training an endpointer model includes short-form speech utterances and long-form speech utterances. The method also includes providing a short-form speech utterance as input to a shared neural network, the shared neural network configured to learn shared hidden representations suitable for both voice activity detection (VAD) and end-of-query (EOQ) detection. The method also includes generating, using a VAD classifier, a sequence of predicted VAD labels and determining a VAD loss by comparing the sequence of predicted VAD labels to a corresponding sequence of reference VAD labels. The method also includes, generating, using an EOQ classifier, a sequence of predicted EOQ labels and determining an EOQ loss by comparing the sequence of predicted EOQ labels to a corresponding sequence of reference EOQ labels. The method also includes training, using a cross-entropy criterion, the endpointer model based on the VAD loss and the EOQ loss.

机译：用于训练终结者模型的方法包括短形式语音和长形式语音。该方法还包括提供简短的语音话语作为对共享神经网络的输入，该共享神经网络被配置为学习适用于语音活动检测（VAD）和查询结束（EOQ）检测的共享隐藏表示。该方法还包括使用VAD分类器生成预测的VAD标签的序列，以及通过将预测的VAD标签的序列与参考VAD标签的对应序列进行比较来确定VAD损失。该方法还包括：使用EOQ分类器生成预测的EOQ标记的序列，并通过将预测的EOQ标记的序列与参考EOQ标记的相应序列进行比较来确定EOQ损失。该方法还包括使用交叉熵标准训练基于VAD损失和EOQ损失的终结者模型。

著录项

公开/公告号US2020117996A1

专利类型
公开/公告日2020-04-16

原文格式PDF
申请/专利权人 GOOGLE LLC;
展开▼

申请/专利号US201916711172
发明设计人 SHUO-YIIN CHANG;BO LI;GABOR SIMKO;MARIA CAROLINA PARADA SAN MARTIN;SEAN MATTHEW SHANNON;
展开▼

申请日2019-12-11
分类号G06N3/08;G06N3/04;G06N5/04;G06N20/20;G06K9/62;G10L15/16;
国家 US
入库时间 2022-08-21 11:25:03

相似文献

专利
外文文献
中文文献