首页> 外国专利> DILATED CONVOLUTIONS AND GATING FOR EFFICIENT KEYWORD SPOTTING

DILATED CONVOLUTIONS AND GATING FOR EFFICIENT KEYWORD SPOTTING

机译:扩张卷积和门控,实现高效的关键字发现

摘要

A method for detection of a keyword in a continuous stream of audio signal, by using a dilated convolutional neural network (DCNN), implemented by one or more computers embedded on a device, the dilated convolutional network (DCNN) comprising a plurality of dilation layers (DL), including an input layer (IL) and an output layer (OL), each layer of the plurality of dilation layers (DL) comprising gated activation units, and skip-connections to the output layer (OL), the dilated convolutional network (DCNN) being configured to generate an output detection signal when a predetermined keyword is present in the continuous stream of audio signal, the generation of the output detection signal being based on a sequence (SSM) of successive measurements (SM) provided to the input layer (IL), each successive measurement (SM) of the sequence (SSM) being measured on a corresponding frame from a sequence of successive frames extracted from the continuous stream of audio signal, at a plurality of successive time steps.
机译:一种通过使用扩张卷积神经网络(DCNN)来检测连续音频信号流中的关键字的方法,该扩张卷积神经网络(DCNN)由嵌入在设备上的一台或多台计算机实现,该扩张卷积网络(DCNN)包括多个扩张层(DL),包括输入层(IL)和输出层(OL),该扩张层的每一层(DL)包括门控激活单元, 以及与输出层(OL)的跳跃连接,膨胀卷积网络(DCNN)被配置为在音频信号的连续流中存在预定关键字时生成输出检测信号,输出检测信号的生成基于提供给输入层(IL)的连续测量(SM)的序列(SSM), 序列(SSM)的每个连续测量(SM)从从音频信号的连续流中提取的连续帧序列的相应帧上测量,以多个连续的时间步长。

著录项

  • 公开/公告号US20220277736A1;US2022000277736A1;US2022277736A1;US2022277736

    专利类型

  • 公开/公告日2022-09-01

    原文格式PDF

  • 申请/专利权人 SONOS VOX FRANCE SAS;

    申请/专利号US17549253;US202100017549253;US202117549253A;US202117549253

  • 发明设计人

    申请日2021-12-13

  • 分类号G10L15/16;G10L15/06;G10L15/22;

  • 国家

  • 入库时间 2024-06-14 23:38:23

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号