Binary spectral masking for speech recognition systems

机译：语音识别系统的二进制频谱屏蔽

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

The purpose of this paper is to examine the use of spectral masking techniques as a preprocessing step in speech recognition systems. The limits of these masking techniques for different levels of the signal-to-noise ratio are discussed. In general, speech recognition systems have low performance in noisy environments or in the presence of other speech signals. This work presents a blind source separation system based on ideal binary masks to deal with real situations in which speech signals are corrupted by noise, including other speech signals. The main contribution of this work is to analyze the performance limits of recognition systems using spectral masking. We obtain an increase of 18 % on the speech hit rate and an average gain of 10 dB in terms of noise level attenuation, when the speech signals were corrupted by other voice signals, with different signal-to-noise ratio of approximately 1, 10 and 20 dB.

机译：本文的目的是研究将频谱屏蔽技术用作语音识别系统中的预处理步骤。讨论了这些掩蔽技术对信噪比不同级别的限制。通常，语音识别系统在嘈杂的环境中或存在其他语音信号时性能低下。这项工作提出了一种基于理想二进制掩码的盲源分离系统，用于处理语音信号被噪声（包括其他语音信号）破坏的实际情况。这项工作的主要贡献是使用频谱掩膜来分析识别系统的性能极限。当语音信号被其他语音信号破坏时，语音命中率提高了18％，噪音水平衰减平均增益为10 dB，信噪比大约为1，10和20 dB。

著录项

来源
《Telecommunications and Signal Processing (TSP), 2012 35th International Conference on》|2012年|p.432- 436|共5页
会议地点 Prague(CZ)
作者
Versiani Thiago de Souza Siqueira; Rodrigues Gustavo Fernandes; Souza Ana Claudia Silva de; Moreira Jussara de Matos; Yehia Hani Camille;
展开▼
作者单位

Federal University of São João del-Rey, Ouro Branco, Minas Gerais, Brazil;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类信号处理;
关键词

相似文献

外文文献
中文文献
专利

1. Constraints on ideal binary masking for the perception of spectrally-reduced speech [J] . Montazeri Vahid, Assmann Peter F. The Journal of the Acoustical Society of America . 2018,第1期

机译：对光谱减小语音感知的理想二元掩模的限制
2. The role of binary mask patterns in automatic speech recognition in background noise [J] . Narayanan A., Wang D. The Journal of the Acoustical Society of America . 2013,第5aPta1期

机译：二进制掩码模式在背景噪声中自动语音识别中的作用
3. Robust speech recognition from binary masks [J] . Narayanan A., Wang D. The Journal of the Acoustical Society of America . 2010,第5期

机译：来自二进制掩码的强大语音识别
4. Binary spectral masking for speech recognition systems [C] . Versiani Thiago de Souza Siqueira, Rodrigues Gustavo Fernandes, Souza Ana Claudia Silva de, International Conference on Telecommunications and Signal Processing . 2012

机译：语音识别系统的二进制光谱屏蔽
5. ASR-driven binary mask estimation for robust automatic speech recognition [D] . Hartmann, William 2012

机译：ASR驱动的二进制掩码估计可实现强大的自动语音识别
6. The role of binary mask patterns in automatic speech recognition in backgroundnoise [O] . Arun Narayanan, a), DeLiang Wang -1

机译：二进制掩码模式在背景中自动语音识别中的作用噪声
7. Constraints on ideal binary masking for the perception of spectrally-reduced speech [O] . Vahid Montazeri, Peter F. Assmann 2018

机译：对光谱减小语音感知的理想二元掩模的限制

Binary spectral masking for speech recognition systems

摘要

著录项

相似文献

相关主题

期刊订阅