首页> 外文会议>IEEE Region 10 Conference >Improved noise robust automatic speech recognition system with spectral subtraction and minimum statistics algorithm implemented in FPGA

【24h】

Improved noise robust automatic speech recognition system with spectral subtraction and minimum statistics algorithm implemented in FPGA

机译：改进了具有FPGA的光谱减法和最小统计算法的噪声鲁棒自动语音识别系统

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this study, spectral subtraction speech enhancement is integrated to a two word vocabulary speech recognition system to effectively reduce the effects of background noise and increase the recognition rate. The whole system was implemented in FPGA and was modelled in MATLAB. The preprocessing subsystem contains the spectral subtraction algorithm and acoustic front end speech enhancements while the speech recognition subsystem contains the HMM and Viterbi search algorithms. 10 dirty speech samples of word ‘stop’ and ‘clockwise’ (sampled at 84 dB) were tested in the speech recognition prototype with varying background noise from 44.6 to 85.4 dB and noise floor (β) from 0.01 to 1. At the end of the testing, the system was able to recognize the two words (stop and clockwise) efficiently with accuracy rate of above 80% until a background noise of 68.6 dB. The best average recognition rate (from 44.6 to 85.4 dB background noise) of 48.5% on the other hand was recorded at 0.01 noise floor. The system without spectral subtraction enhancement was noticed to function efficiently only at 56.6 dB.

机译：在该研究中，光谱减法语音增强被集成到两个词汇表语音识别系统，以有效地降低背景噪声的影响并提高识别率。整个系统是在FPGA实施的，并在Matlab中进行了建模。预处理子系统包含光谱减法算法和声学前端语音增强，而语音识别子系统包含HMM和Viterbi搜索算法。在语音识别原型中测试了10个“停止”和“顺时针”（在84dB上采样）的脏话样本，从44.6到85.4 dB和噪声地板（β）之间的变化从44.6到1的噪声（β）。测试，系统能够以高于80％的精度率（直到68.6 dB的背景噪声为高于80％的精度才能识别两个单词（停止和顺时针）。另一方面，另一方面，最佳平均识别率（从44.6至85.4 dB背景噪声）以0.01噪声录制。没有谱减法增强的系统被注意到仅在56.6dB处有效起作用。

著录项

来源
《IEEE Region 10 Conference》|2012年||共6页
会议地点
作者
Orillo John William; Yap Roderick; Sybingco Edwin;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP3-53;
关键词
Spectral Subtraction; Speech Recognition; Viterbi search;

机译：光谱减法;语音识别;维特比搜索;

相似文献

外文文献
中文文献
专利

1. Robust speech recognition by using spectral subtraction with noise peak shifting [J] . Dai P., Soon I.Y. Signal Processing, IET . 2013,第8期

机译：通过使用谱峰相减和噪声峰值偏移来实现可靠的语音识别
2. Improving performance of spectral subtraction in speech recognition using a model for additive noise [J] . Yoma N.B., McInnes F.R. IEEE Transactions on Speech and Audio Proceeding . 1998,第6期

机译：使用加性噪声模型提高语音识别中频谱减法的性能
3. Improved noise minimum statistics estimation algorithm for using in a speech-passing noise-rejecting headset [J] . Seyedtabaee S., Moazami Goodarzi H. EURASIP journal on advances in signal processing . 2010,第14期

机译：用于语音传递降噪耳机的改进的噪声最小统计估计算法
4. Improved noise robust automatic speech recognition system with spectral subtraction and minimum statistics algorithm implemented in FPGA [C] . Orillo John William, Yap Roderick, Sybingco Edwin 2012 IEEE Region 10 Conference: sustainable development through humanitarian technology. . 2012

机译：改进的具有频谱减法和最小统计算法的噪声鲁棒自动语音识别系统，在FPGA中实现
5. Compressive nonlinearity for representing speech spectral magnitude to improve noise robustness of automatic speech recognition . [D] . Wong, Brian. 2011

机译：压缩非线性表示语音频谱幅度提高语音自动识别的鲁棒性。
6. Spectral and Temporal Envelope Cues for Human and Automatic Speech Recognition in Noise [O] . Guangxin Hu, Sarah C. Determan, Yue Dong, 2020

机译：用于噪声中的人类和自动语音识别的光谱和颞包络线
7. Robust Automatic Speech recognition System Implemented in a Hybrid Design DSP-FPGA [O] . Ali Aldahoud, Hamza Atoui, Mohamed Fezari 2014

机译：在混合设计Dsp-FpGa中实现的鲁棒自动语音识别系统

Improved noise robust automatic speech recognition system with spectral subtraction and minimum statistics algorithm implemented in FPGA

摘要

著录项

相似文献

相关主题

期刊订阅