首页> 外国专利> SYSTEM AND METHOD FOR ON-DEVICE OPEN-VOCABULARY KEYWORD SPOTTING

SYSTEM AND METHOD FOR ON-DEVICE OPEN-VOCABULARY KEYWORD SPOTTING

机译:用于设备开放词汇的系统和方法

摘要

Device (1) comprising a keyword encoder (2), an acoustic encoder (3), a keyword detector (4); - wherein the keyword encoder (2) is a first neural network configured to determine, for each keyword of a set of keywords (abc, bac, cba, cab), at least one parameter, the at least one parameter being determined based on a keyword representation (7), said keyword representation (7) being provided as input to the keyword encoder (2), the keyword encoder (2) being configured to communicate the at least one parameter to the keyword detector (4), for each keyword of the set of keywords (abc, bac, cba, cab); - wherein the acoustic encoder (3) consists in a second neural network trained for automatic speech recognition, the acoustic encoder (3) being configured to take as input a set of feature vectors, each feature vector of the set of feature vector comprising at least one feature characterizing an acoustic signal (5), the acoustic encoder (3) being further configured to provide as output a set of intermediate feature vectors (6), each intermediate feature vector of the set of intermediate feature vectors (6) comprising at least one feature, and being input to the keyword detector (4); - wherein the keyword detector (4) is a third neural network configured, by the at least one parameter, to predict, for each keyword in the set of keywords (abc, bac, cba, cab), a probability of detection of said keyword in the acoustic signal, the probability of detection (P1,P2,P3,P4) of said keyword being based on the set of intermediate feature vectors (6) provided by the acoustic encoder (3).
机译:设备(1)包括关键字编码器(2),声学编码器(3),关键字检测器(4); - 其中,关键字编码器(2)是第一神经网络,其被配置为针对一组关键字(ABC,BAC,CBA,CAB),至少一个参数来确定的第一神经网络,所述至少一个参数,所述至少一个参数基于a确定关键字表示(7),所述关键字表示(7)被提供为关键字编码器(2)的输入,关键字编码器(2)被配置为为每个关键字传送至少一个参数到关键字检测器(4)关键词(ABC,BAC,CBA,驾驶室)的集合; - 其中声学编码器(3)包括用于自动语音识别的第二神经网络,声学编码器(3)被配置为以输入的一组特征向量应用于一组特征向量,其集合矢量包括至少一个特征表征声学信号(5),声学编码器(3)进一步被配置为提供作为输出的中间特征向量(6),每个中间特征向量(6)的每个中间特征向量(6)包括至少一个功能,并输入关键字检测器(4); - 其中,关键字检测器(4)是由所述至少一个参数配置的第三神经网络,用于预测所述关键字集(ABC,BAC,CBA,CAB)中的每个关键字,检测所述关键字的概率在声学信号中,所述关键字的检测(P1,P2,P3,P4)的概率基于由声学编码器(3)提供的一组中间特征向量(6)。

著录项

  • 公开/公告号WO2021094607A1

    专利类型

  • 公开/公告日2021-05-20

    原文格式PDF

  • 申请/专利权人 SONOS VOX FRANCE SAS;

    申请/专利号WO2020EP82243

  • 发明设计人 BLUCHE THÉODORE;

    申请日2020-11-16

  • 分类号G10L15/16;G10L15/08;

  • 国家 EP

  • 入库时间 2022-08-24 18:51:52

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号