The Use of Overlapped Sub-Bands in Multi-Band, Multi-SNR, Multi-Path Recognition of Noisy Word Utterances

Yutaka TSUBOI; Takehiro IHARA; Kazuyuki TAKAGI; Kazuhiko OZEKI

首页> 外文期刊>IEICE Transactions on Information and Systems >The Use of Overlapped Sub-Bands in Multi-Band, Multi-SNR, Multi-Path Recognition of Noisy Word Utterances

【24h】

The Use of Overlapped Sub-Bands in Multi-Band, Multi-SNR, Multi-Path Recognition of Noisy Word Utterances

机译：重叠的子频带在多频带，多SNR，多路径的有声单词话语识别中的使用

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

A solution to the problem of improving robustness to noise in automatic speech recognition is presented in the framework of multi-band, multi-SNR, and multi-path approaches. In our word recognizer, the whole frequency band is divided into seven-overlapped sub-bands, and then sub-band noisy phoneme HMMs are trained on speech data mixed with the filtered white Gaussian noise at multiple SNRs. The acoustic model of a word is built as a set of concatenations of clean and noisy sub-band phoneme HMMs arranged in parallel. A Viterbi decoder allows a search path to transit to another SNR condition at a phoneme boundary. The recognition scores of the sub-bands are then recombined to give the score for a word. Experiments show that the overlapped seven-band system yields the best performance under nonstationary ambient noises. It is also shown that the use of filtered white Gaussian noise is advantageous for training noisy phoneme HMMs.

机译：在多频带，多SNR和多路径方法的框架下，提出了一种解决方案，该方案在自动语音识别中提高了对噪声的鲁棒性。在我们的单词识别器中，将整个频带划分为七个重叠的子带，然后在混合有多个SNR的滤波后的高斯白噪声的语音数据上训练子带噪声音素HMM。单词的声学模型构建为一组并行排列的干净且嘈杂的子带音素HMM。维特比解码器允许搜索路径转换到音素边界处的另一个SNR条件。然后将子带的识别分数重新组合以给出单词的分数。实验表明，重叠的七波段系统在非平稳环境噪声下具有最佳性能。还表明，使用滤波后的高斯白噪声对训练有噪音素HMM有利。

著录项

来源
《IEICE Transactions on Information and Systems》 |2008年第6期|p.1774-1782|共9页
作者
Yutaka TSUBOI; Takehiro IHARA; Kazuyuki TAKAGI; Kazuhiko OZEKI;
展开▼
作者单位

University of Electro-Communications, Chofu-shi, 182-8585 Japan;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类无线电电子学、电信技术;
关键词
multi-band; multi-SNR; multi-path; overlapped sub-bands; noisy word recognition; ambient noise;

机译：多频带;多SNR;多径;子频带重叠;噪声词识别;环境噪声;
入库时间 2022-08-18 00:27:43

相似文献

外文文献
中文文献
专利

1. Rapid Environment Adaptation Method Based on HMM Composition with Prior Noise GMM and Multi-SNR Models for Noisy Speech Recognition [J] . Masaki Ida, Satoshi Nakamura Electronics and Communications in Japan. Part 2, Electronics . 2004,第6期

机译：基于先验噪声GMM和Multi-SNR模型的HMM组合的快速环境自适应方法
2. Rapid model adaptation with a prior noise GMM and multi-SNR models for noisy speech recognition [J] . Masaki Ida, Satoshi Nakamura 電子情報通信学会技術研究報告. 言語理解とコミュニケーション. Natural Language Understanding and Models of Communication . 2001,第520期

机译：利用先前的噪声GMM和多SNR模型对噪声语音识别进行快速模型自适应
3. Rapid model adaptation with a prior noise GMM and multi-SNR models for noisy speech recognition [J] . Masaki Ida, Satoshi Nakamura 電子情報通信学会技術研究報告. 音声. Speech . 2001,第522期

机译：利用先前的噪声GMM和多SNR模型对噪声语音识别进行快速模型自适应
4. Optimization of Sub-Band weights Using Simulated Noisy Speech In Multi-Band Speech Recognition [C] . Yik-Cheung Tam, Brian Mak 6th International Conference on Spoken Language Processing ICSLP 2000 Oct.16-Oct.20 2000 Beijing International Convention Center, Beijing, China . 2000

机译：模拟噪声在多频带语音识别中的子频带权重优化
5. Trends in mean length of utterance before words and grammar [D] . Fagan, Mary K. 2005

机译：单词和语法之前的平均发音长度趋势
6. The bounds estimate of sub-band operators for multi-band wavelets [O] . Qingyun Zou, Guoqiu Wang, Qian Cao -1

机译：多带小波子带算子的边界估计
7. Out-of-Task Utterance Detection Based on Bag-of-Words Using Automatic Speech Recognition Results [O] . Fujita Yoko, Takeuchi Shota, Kawanami Hiromichi, 2011

机译：自动语音识别结果基于词袋的任务外话语检测

The Use of Overlapped Sub-Bands in Multi-Band, Multi-SNR, Multi-Path Recognition of Noisy Word Utterances

摘要

著录项

相似文献

相关主题

期刊订阅