首页> 外国专利> Implementing a high accuracy continuous speech recognizer on a fixed-point processor

Implementing a high accuracy continuous speech recognizer on a fixed-point processor

机译：在定点处理器上实现高精度连续语音识别器

页面导航

摘要
著录项
相似文献

摘要

A small vocabulary speech recognizer suitable for implementation on a 16-bit fixed-point DSP is described. The input speech xt is sampled at analog-to-digital (A/D) converter 11 and the digital samples are applied to MFCC (Mel-scaled cepstrum coefficients) front end processing 13. For robustness to background noises, PMC (parallel model combination) 15 is integrated. The MFCC and Gaussian mean vectors are applied to PMC 15. The MFCC and PMC provide speech features extracted in noise and this is used to modify the HMMs. The noise adapted HMMs excluding mean vectors are applied to the search procedure to recognize the grammar. A method of computing MFCC comprises the steps of: performing dynamic Q-point computation for the preemphasis, Hamming Window, FFT, complex FFT to power spectrum and Mel scale power spectrum into filter bank steps, a log filter bank step and after the log filter bank step performing fixed Q-point computation. A polynomial fit is used to compute log2 in the log filter bank step. The method of computing PMC comprises the steps of: computing noise MFCC profile, computing cosine transform MFCC into mel-scale filter bank, converting log filter bank into linear filter bank with an exponential wherein to compute exp2 a polynomial fit is used, performing a model combination in the linear filter bank domain; and converting the noise compensated linear filter bank into MFCC by log and inverse cosine transform.

机译：描述了适用于16位定点DSP的小型词汇语音识别器。输入语音xt在模数（A / D）转换器 11 中进行采样，并将数字采样应用于MFCC（梅尔缩放倒谱系数）前端处理 13。为了增强对背景噪声的鲁棒性，集成了PMC（并行模型组合） 15 。将MFCC和高斯平均向量应用于PMC 15。 MFCC和PMC提供了从噪声中提取的语音功能，用于修改HMM。排除均值向量的经过噪声适应的HMM应用于搜索过程以识别语法。一种计算MFCC的方法，包括以下步骤：对预加重执行动态Q点计算，汉明窗，FFT，将复数FFT变换为功率谱并将Mel标度功率谱分为滤波器组步骤，对数滤波器组步骤和对数滤波器之后库步执行固定的Q点计算。多项式拟合用于在对数过滤器库步骤中计算log2。 PMC的计算方法包括以下步骤：计算噪声MFCC轮廓，将余弦变换MFCC计算为梅尔尺度滤波器组，将对数滤波器组转换为具有指数的线性滤波器组，其中为了计算exp2，使用多项式拟合，执行模型线性滤波器组域中的组合;通过对数和反余弦变换将噪声补偿线性滤波器组转换为MFCC。 展开▼

著录项

公开/公告号US2002198706A1

专利类型

公开/公告日2002-12-26

原文格式PDF

申请/专利权人 KAO YU-HUNG;GONG YIFAN;
展开▼

申请/专利号US20020136967

发明设计人 YU-HUNG KAO;YIFAN GONG;
展开▼

申请日2002-05-02

分类号G10L21/00;

国家 US

入库时间 2022-08-22 00:08:48

相似文献

专利

外文文献

中文文献

1. 在磁盘驱动器的定点数字信号处理器上实现线性离散时间状态空间伺服控制器的方法 [P] . 中国专利： CN1266526A . 2000-09-13

2. 在通用可重构处理器上实现高精度除法的配置方法 [P] . 中国专利： CN105302769A . 2016-02-03

3. Implementing a high accuracy continuous speech recognizer on a fixed-point processor [P] . 美国专利： US7103547B2 . 2006-09-05

机译：在定点处理器上实现高精度连续语音识别器

4. CONTINUOUS WORD RECOGNITION SYSTEM FOR SPEECH RECOGNIZING DEVICE AND RECORDING MEDIUM STORED WITH CONTINUOUS WORD RECOGNITION PROCESSING PROGRAM IN SPEECH RECOGNIZING DEVICE [P] . 日本专利： JP2000089788A . 2000-03-31

机译：语音识别装置中的语音识别装置和存储有连续字识别处理程序的存储介质的连续字识别系统

5. CONTINUOUS WORD RECOGNITION SYSTEM FOR SPEECH RECOGNIZING DEVICE AND RECORDING MEDIUM STORED WITH CONTINUOUS WORD RECOGNITION PROCESSING PROGRAM IN SPEECH RECOGNIZING DEVICE [P] . 日本专利： JP2000089787A . 2000-03-31

机译：语音识别装置中的语音识别装置和存储有连续字识别处理程序的存储介质的连续字识别系统

1. Improved Real-time Implementation of Adaptive Gassian Mixture Model-based Object Detection Algorithm for Fixed-point DSP Processors [J] . Byung-eun LEE, Thanh-binh NGUYEN, Sun-tae CHUNG 测试科学与仪器：英文版 . 2010,第002期

2. AN　ANALYSIS　OF　ACOUSTIC　CHARACTERISTICS　OFCLEFT　PALATE　SPEECH　WITH　COMPUTERIZED　SPEECH　SIGNAL　PROCESSING　SYSTEM [J] . 李锦峰, 刘建华药物分析学报：英文版 . 1996,第002期

3. Implementing a high accuracy speaker-independent continuous speech recognizer on a fixed-point DSP [C] . Yifan Gong, Yu-Hung Kao . 2000

4. Automating transformations from floating-point to fixed-point for implementing digital signal processing algorithms. [D] . Han, Kyungtae. 2006

5. The Auditory-Brainstem Response to Continuous Non-repetitive Speech Is Modulated by the Speech Envelope and Reflects Speech Processing [O] . Chagit S. Reichenbach, Chananel Braiman, Nicholas D. Schiff, 2016

6. FPGA-based Implementation of a Real-time 5000-word Continuous Speech Recognizer [O] . Choi Young-kyu, Sung Wonyong, You Kisun 2008

1. Improved Real-time Implementation of Adaptive Gassian Mixture Model-based Object Detection Algorithm for Fixed-point DSP Processors [J] . Byung-eun LEE ,Thanh-binh NGUYEN ,Sun-tae CHUNG . 测试科学与仪器 . 2010,第002期

2. A Multi-Band Speech Enhancement Algorithm Exploiting Iterative Processing for Enhancement of Single Channel Speech [J] . Navneet Upadhyay ,Abhijit Karmakar . 信号与信息处理（英文） . 2013,第2期

3. Fixed-point Implementation of Approximate Message Passing （AMP） algorithm in massive MIMO systems [J] . Mhlaliseni Khumalo ,Wan-Ting Shi ,Chao-Kai Wen . 数字化用户：数字通讯 . 2016,第004期

4. 基于定点处理器的浮点FFT算法设计与实现 [C] . 黄琳琳 ,刘仲 . 第十八届计算机工程与工艺年会暨第四届微处理器技术论坛 . 2014

5. 一款基于高效流水线的16位定点嵌入式数字信号处理器的设计与实现 [A] . 曾晓文 . 2005