Improvements of a dual-input DBN for noise robust ASR

机译：双输入DBN的改进以增强抗噪ASR

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In previous work we have shown that an ASR system consisting of a dual-input Dynamic Bayesian Network (DBN) which simultaneously observes MFCC acoustic features and an exemplar-based Sparse Classification (SC) phoneme predictor stream can achieve better word recognition accuracies in noise than a system that observes only one input stream. This paper explores three modifications of SC input to further improve the noise robustness of the dual-input DBN system: 1) using state likelihoods instead of phonemes, 2) integrating more contextual information and 3) using a complete set of likelihood distribution. Experiments on aurora-2 reveal that the combination of the first two approaches significantly improves the recognition results, achieving up to 29% (absolute) accuracy gain at SNR -5 dB. In the dual-input system using the full likelihood vector does not outperform using the best state prediction.

机译：在先前的工作中，我们表明，由双输入动态贝叶斯网络（DBN）同时观察MFCC声学特征和基于示例的稀疏分类（SC）音素预测器流组成的ASR系统，在噪声方面比在单词识别方面的准确性要高。一种仅观察一个输入流的系统。本文探讨了SC输入的三种修改，以进一步提高双输入DBN系统的噪声鲁棒性：1）使用状态似然代替音素，2）集成更多上下文信息，以及3）使用完整的似然分布集。在aurora-2上进行的实验表明，前两种方法的组合可显着改善识别结果，在SNR -5 dB时可获得高达29％（绝对）的准确度增益。在使用全似然向量的双输入系统中，使用最佳状态预测不会胜过任何情况。

著录项

来源
《Annual conference of the International Speech Communication Association;INTERSPEECH 2011》|2011年|p.1680-1683|共4页
会议地点
作者
Yang Sun; Jort F. Gemmeke; Bert Cranen; Louis ten Bosch; Lou Boves;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信;
关键词
ASR; noise robustness; sparse classification; dual-input DBN;

机译：ASR;噪声鲁棒性;稀疏分类双输入DBN;
入库时间 2022-08-26 15:06:02

相似文献

外文文献
中文文献
专利

1. A Pitch-Synchronous Peak-Amplitude Based Feature Extraction Method with Noise Reduction, Modulation Enhancement, and Masking for Noise-Robust ASR [J] . Muhammad GHULAM, Junsei HORIKAWA, Tsuneo NITTA 電子情報通信学会技術研究報告. 音声. Speech . 2005,第496期

机译：一种基于变桨同步峰值幅度的降噪，调制增强和掩蔽的鲁棒ASR特征提取方法
2. A Pitch-Synchronous Peak-Amplitude Based Feature Extraction Method with Noise Reduction, Modulation Enhancement, and Masking for Noise-Robust ASR [J] . Muhammad GHULAM, Junsei HORIKAWA, Tsuneo NITTA, 電子情報通信学会技術研究報告. 音声. Speech . 2005,第496期

机译：一种基于变桨同步峰值幅度的降噪，调制增强和掩蔽的鲁棒ASR特征提取方法
3. A Pitch-Synchronous Peak-Amplitude Based Feature Extraction Method with Noise Reduction, Modulation Enhancement, and Masking for Noise-Robust ASR [J] . Muhammad GHULAM, Junsei HORIKAWA, Tsuneo NITTA 電子情報通信学会技術研究報告. 言語理解とコミュニケーション. Natural Language Understanding and Models of Communication . 2005,第494期

机译：一种基于变桨同步峰值幅度的降噪，调制增强和掩蔽的鲁棒ASR特征提取方法
4. Learning robust features from underwater ship-radiated noise with mutual information group sparse DBN [C] . Sheng SHEN, Honghui YANG, Zhen HAN, International Congress and Exposition on Noise Control Engineering . 2016

机译：学习来自水下船辐射噪声的强大功能，具有相互信息组稀疏DBN
5. Robust Networks: Neural Networks Robust to Quantization Noise and Analog Computation Noise Based on Natural Gradient [D] . Kadambi, Pradyumna . 2019

机译：强大的网络：神经网络基于自然梯度的量化噪声和模拟计算噪声鲁棒
6. Correction: Accuracy Maximization Analysis for Sensory-Perceptual Tasks: Computational Improvements, Filter Robustness, and Coding Advantages for Scaled Additive Noise [O] . 2018

机译：校正：感官任务的精度最大化分析：计算上的改进，滤波器的鲁棒性和成比例增加噪声的编码优势
7. Improvements of a dual-input DBN for noise robust ASR [O] . Sun Yang, Gemmeke Jort, Cranen Bert, 2011

机译：双输入DBN的改进以增强抗噪ASR

Improvements of a dual-input DBN for noise robust ASR

摘要

著录项

相似文献

相关主题

期刊订阅