Easy does it: Robust spectro-temporal many-stream ASR without fine tuning streams

机译：简单易行：强大的频谱时间多流ASR，无需微调流

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Previous work has shown that spectro-temporal features reduce the word error rate for automatic speech recognition under noisy conditions. These systems, however, required significant hand-tuning in order to determine which spectral and temporal modulations should be included in a particular stream. In this work, streams are split into one spectral and temporal modulation each and their posterior probabilities are combined once each stream is discriminatively trained via multilayer perceptron. We show that this combination structure performs as well or better than more elaborate methods in which multiple spectral and temporal modulations are hand-picked per stream. In addition, these type of features outperform standard noise-robust features such as the “Advanced Front End” features, whereas our hand-picked spectro-temporal features do not.

机译：先前的工作表明，时空特征可以降低嘈杂条件下自动语音识别的单词错误率。但是，这些系统需要进行大量的手动调整，才能确定特定流中应包含哪些频谱和时间调制。在这项工作中，将流分别分成一个频谱和时间调制，一旦通过多层感知器对每个流进行判别式训练，它们的后验概率就会合并在一起。我们表明，这种组合结构的性能比精巧的方法好或更好，在精巧的方法中，每个流均会手动选择多个频谱和时间调制。此外，这些类型的功能要优于标准的抗噪功能，例如“高级前端”功能，而我们手工挑选的光谱时功能则不然。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP》|2012年|p.4309- 4312|共4页
会议地点 Kyoto(JP)
作者
Ravuri, Suman V.;
展开▼
作者单位

International Computer Science Institute Berkeley CA 94704 USA;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Spectro-temporal Power Spectrum Features for Noise Robust ASR [J] . Seresht Hamed Riazati, Ahadi Seyed Mohammad, Seyedin Sanaz Circuits, systems, and signal processing . 2017,第8期

机译：噪声鲁棒ASR的频谱时功率谱特性
2. Robust cortical entrainment to the speech envelope relies on the spectro-temporal fine structure [J] . DingN., ChatterjeeM., SimonJ.Z. NeuroImage . 2014,第Null期

机译：语音包络的牢固皮层夹带依赖于光谱时的精细结构
3. Robust cortical entrainment to the speech envelope relies on the spectro-temporal fine structure [J] . DingN., ChatterjeeM., SimonJ.Z. NeuroImage . 2014,第Null期

机译：对语音信封的强大皮质夹带依赖于光谱 - 时间精细结构
4. Easy does it: Robust spectro-temporal many-stream ASR without fine tuning streams [C] . Ravuri Suman V. IEEE International Conference on Acoustics, Speech and Signal Processing . 2011

机译：易于做到：强大的光谱 - 时间多流ASR，没有微调流
5. Noise-robust spectro-temporal acoustic signature recognition using nonlinear Hebbian learning [D] . Lu, Bing 2009

机译：非线性Hebbian学习的鲁棒光谱时空声学特征识别
6. Robust Cortical Entrainment to the Speech Envelope Relies on the Spectro-temporal Fine Structure [O] . Nai Ding, Monita Chatterjee, Jonathan Z. Simon -1

机译：语音包络的鲁棒皮层夹带依赖于时空精细结构
7. The Joint Optimization of Spectro-Temporal Features and Deep Neural Nets for Robust ASR [O] . Kovács György, Tóth László 2014

机译：鲁棒ASR的光谱时态特征和深层神经网络的联合优化

Easy does it: Robust spectro-temporal many-stream ASR without fine tuning streams

摘要

著录项

相似文献

相关主题

期刊订阅