首页> 外国专利> A feature extraction system, an automatic speech recognition system, a feature extraction method, an automatic speech recognition method and a method of train

A feature extraction system, an automatic speech recognition system, a feature extraction method, an automatic speech recognition method and a method of train

机译:特征提取系统,自动语音识别系统,特征提取方法,自动语音识别方法和火车方法

摘要

An automatic speech recognition system, comprising: an input for receiving a speech signal; and a processor configured to: filter an input speech signal using a filter bank comprising a plurality of filters, wherein each filter in the filter bank modifies the input speech signal in the time domain by a different frequency dependent gain, the filter bank outputting a time domain signal from each filter; extract a temporal envelope from the output time domain signal from each filter in the filter bank; frame the temporal envelopes; extract a feature vector for each frame, wherein each feature vector comprises a feature coefficient extracted from the frame of the temporal envelope of the output time domain signal from each filter in the filter bank; input the feature vectors into a deep neural network based classifier, the classifier generating one or more automatic speech recognition hypotheses corresponding to the input speech signal. The filter bank is a Gammatone filter bank. Extracting the temporal envelope from the output time domain signal comprises full wave rectifying the output time domain signal from each filter in the filter bank and low pass filtering each of the rectified signals. A time delay neural network (TDNN) de-noising auto encoder is used.
机译:一种自动语音识别系统,包括:用于接收语音信号的输入;处理器,其被配置为:使用包括多个滤波器的滤波器组来滤波输入语音信号,其中滤波器组中的每个滤波器在时域中通过不同的频率相关增益来修改输入语音信号,滤波器组输出时间来自每个滤波器的域信号;从滤波器组中每个滤波器的输出时域信号中提取时间包络;构图时间包络;为每个帧提取特征向量,其中每个特征向量包括从滤波器组中每个滤波器的输出时域信号的时间包络的帧中提取的特征系数;将特征向量输入到基于深度神经网络的分类器中,分类器生成对应于输入语音信号的一个或多个自动语音识别假设。滤波器组是Gammatone滤波器组。从输出时域信号中提取时间包络包括对来自滤波器组中的每个滤波器的输出时域信号进行全波整流,并对每个整流后的信号进行低通滤波。使用了时延神经网络(TDNN)去噪自动编码器。

著录项

  • 公开/公告号GB2560174A

    专利类型

  • 公开/公告日2018-09-05

    原文格式PDF

  • 申请/专利权人 KABUSHIKI KAISHA TOSHIBA;

    申请/专利号GB20170003310

  • 发明设计人 CONG THANH DO;IOANNIS STYLIANOU;

    申请日2017-03-01

  • 分类号G10L15/20;G10L21/02;

  • 国家 GB

  • 入库时间 2022-08-21 12:32:04

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号