首页> 外国专利> METHOD AND SYSTEM FOR GENERATING ADVANCED FEATURE DISCRIMINATION VECTORS FOR USE IN SPEECH RECOGNITION

METHOD AND SYSTEM FOR GENERATING ADVANCED FEATURE DISCRIMINATION VECTORS FOR USE IN SPEECH RECOGNITION

机译：生成用于语音识别的高级特征识别向量的方法和系统

页面导航

摘要
著录项
相似文献

摘要

A method of renormalizing high-resolution oscillator peaks, extracted from windowed samples of an audio signal, is disclosed. Feature vectors are generated for which variations in both fundamental frequency and time duration of speech are substantially mitigated. The feature vectors may be aligned within a common coordinate space, free of those variations in frequency and time duration that occurs between speakers, and even over speech by a single speaker, to facilitate a simple and accurate determination of matches between those AFDVs generated from a sample of the audio signal and corpus AFDVs generated for known speech at the phoneme and sub-phoneme level. The renormalized feature vectors can be combined with traditional feature vectors such as MFCCs, or they can be used exclusively to identify voiced, semi-voiced and unvoiced sounds.

机译：公开了一种重新标准化从音频信号的窗口采样中提取的高分辨率振荡器峰值的方法。生成特征向量，对于这些特征向量，基本频率和语音持续时间的变化都得到了缓解。特征向量可以在公共坐标空间内对齐，没有出现在说话者之间的频率和持续时间的变化，甚至没有单个说话者在语音上发生的变化，以便于简单，准确地确定从一个扬声器生成的那些AFDV之间的匹配。在音素和子音素级别为已知语音生成的音频信号和语料库AFDV的样本。重新归一化的特征向量可以与诸如MFCC之类的传统特征向量组合，或者它们可以专门用于识别浊音，半浊音和非浊音。

著录项

公开/公告号US2020160839A1

专利类型
公开/公告日2020-05-21

原文格式PDF
申请/专利权人 XMOS INC.;
展开▼

申请/专利号US201916520104
发明设计人 KEVIN M. SHORT;BRIAN HONE;
展开▼

申请日2019-07-23
分类号G10L15/02;G10L25/21;G10L25/18;G10L25/24;G10L25/03;
国家 US
入库时间 2022-08-21 11:22:54

相似文献

专利
外文文献
中文文献