【24h】

Binary spectral masking for speech recognition systems

机译:语音识别系统的二进制频谱屏蔽

获取原文
获取原文并翻译 | 示例

摘要

The purpose of this paper is to examine the use of spectral masking techniques as a preprocessing step in speech recognition systems. The limits of these masking techniques for different levels of the signal-to-noise ratio are discussed. In general, speech recognition systems have low performance in noisy environments or in the presence of other speech signals. This work presents a blind source separation system based on ideal binary masks to deal with real situations in which speech signals are corrupted by noise, including other speech signals. The main contribution of this work is to analyze the performance limits of recognition systems using spectral masking. We obtain an increase of 18 % on the speech hit rate and an average gain of 10 dB in terms of noise level attenuation, when the speech signals were corrupted by other voice signals, with different signal-to-noise ratio of approximately 1, 10 and 20 dB.
机译:本文的目的是研究将频谱屏蔽技术用作语音识别系统中的预处理步骤。讨论了这些掩蔽技术对信噪比不同级别的限制。通常,语音识别系统在嘈杂的环境中或存在其他语音信号时性能低下。这项工作提出了一种基于理想二进制掩码的盲源分离系统,用于处理语音信号被噪声(包括其他语音信号)破坏的实际情况。这项工作的主要贡献是使用频谱掩膜来分析识别系统的性能极限。当语音信号被其他语音信号破坏时,语音命中率提高了18%,噪音水平衰减平均增益为10 dB,信噪比大约为1,10和20 dB。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号