Integration of beamforming and uncertainty-of-observation techniques for robust ASR in multi-source environments

Ramon Fernandez Astudillo; Dorothea Kolossa; Alberto Abad; Steffen Zeiler; Rahim Saeidi; Pejman Mowlaee; Joao Paulo da Silva Neto; Rainer Martin

首页> 外文期刊>Computer speech and language >Integration of beamforming and uncertainty-of-observation techniques for robust ASR in multi-source environments

【24h】

Integration of beamforming and uncertainty-of-observation techniques for robust ASR in multi-source environments

机译：集成了波束成形和观测不确定性技术，可在多源环境中实现强大的ASR

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper presents a new approach for increasing the robustness of multi-channel automatic speech recognition in noisy and reverberant multi-source environments. The proposed method uses uncertainty propagation techniques to dynamically compensate the speech features and the acoustic models for the observation uncertainty determined at the beamforming stage. We present and analyze two methods that allow integrating classical multi-channel signal processing approaches like delay and sum beamformers or Zelinski-type Wiener filters, with uncertainty-of-observation techniques like uncertainty decoding or modified imputation. An analysis of the results on the PASCAL-CHiME task shows that this approach consistently outperforms conventional beamformers with a minimal increase in computational complexity. The use of dynamic compensation based on observation uncertainty also outperforms conventional static adaptation with no need of adaptation data.

机译：本文提出了一种新方法，可在嘈杂和混响多源环境中提高多通道自动语音识别的鲁棒性。所提出的方法使用不确定性传播技术来动态补偿语音特征和声学模型，以用于在波束形成阶段确定观测不确定性。我们介绍并分析了两种方法，这些方法可以将经典的多通道信号处理方法（如延迟和求和波束形成器或Zelinski型维纳滤波器）与不确定性观察技术（如不确定性解码或修正归因）集成在一起。对PASCAL-CHiME任务的结果进行的分析表明，该方法始终优于传统的波束形成器，并且计算复杂性的增加最小。基于观测不确定性的动态补偿的使用也优于传统的静态自适应，无需自适应数据。

著录项

来源
《Computer speech and language》 |2013年第3期|837-850|共14页
作者
Ramon Fernandez Astudillo; Dorothea Kolossa; Alberto Abad; Steffen Zeiler; Rahim Saeidi; Pejman Mowlaee; Joao Paulo da Silva Neto; Rainer Martin;
展开▼
作者单位

Spoken Language Systems Lab, INESC-ID, Lisbon, Portugal;

Institute of Communication Acoustics, Ruhr-Universitaet Bochum, Germany;

Spoken Language Systems Lab, INESC-ID, Lisbon, Portugal;

Institute of Communication Acoustics, Ruhr-Universitaet Bochum, Germany;

Centre for Language and Speech Technology, Radboud University Nijmegen, The Netherlands;

Institute of Communication Acoustics, Ruhr-Universitaet Bochum, Germany;

Spoken Language Systems Lab, INESC-ID, Lisbon, Portugal;

Institute of Communication Acoustics, Ruhr-Universitaet Bochum, Germany;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
robust speech recognition; beamforming; uncertainty decoding; uncertainty propagation;

机译：强大的语音识别;波束成形不确定性解码;不确定性传播;

相似文献

外文文献
中文文献
专利

1. Robust and Efficient DOA-Steered Adaptive MVDR-FROST Beamforming Model for Multi-source Low SNR Environment [J] . Mane Sunita V., Bombale Uttam L. International journal of wireless information networks . 2021,第2期

机译：用于多源低SNR环境的鲁棒和高效的DOA转向自适应MVDR霜霜模型
2. Robust Digital Retrodirective Beamforming Technique for Multipath Channel Environment [J] . Changyoung An, Kukhan Jang, Heung-Gyoon Ryu, Procedia Computer Science . 2015,第1期

机译：用于多径信道环境的稳健数字逆向波束成形技术
3. Distributed Robust Beamforming Based on Low-Rank and Cross-Correlation Techniques: Design and Analysis [J] . IEEE Transactions on Signal Processing . 2019,第24期

机译：基于低秩和互相关技术的分布式鲁棒波束成形：设计与分析
4. Robust Speech Enhancement Techniques for ASR in Non-stationary Noise and Dynamic Environments [C] . Gang Liu, Dimitrios Dimitriadis, Enrico Bocchieri Conference of the International Speech Communication Association . 2013

机译：非静止噪声和动态环境中ASR的强大语音增强技术
5. Robust signal processing techniques for source localization and multisource spatial sound rendering for immersive environments. [D] . Georgiou, Panayiotis G. 2002

机译：强大的信号处理技术，可用于沉浸式环境中的源定位和多源空间声音渲染。
6. Robust Filtering Techniques for RTK Positioning in Harsh Propagation Environments [O] . Daniel Medina, Haoqing Li, Jordi Vilà-Valls, 2021

机译：RTK定位在苛刻的传播环境中的强大过滤技术
7. Robust Digital Retrodirective Beamforming Technique for Multipath Channel Environment [O] . An Changyoung, Jang Kukhan, Ryu Heung-Gyoon, 2015

机译：用于多径信道环境的稳健的数字逆向波束成形技术

Integration of beamforming and uncertainty-of-observation techniques for robust ASR in multi-source environments

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅