...
首页> 外文期刊>Journal of Signal and Information Processing >Noise Removal in Speech Processing Using Spectral Subtraction
【24h】

Noise Removal in Speech Processing Using Spectral Subtraction

机译:使用频谱相减的语音处理中的噪声去除

获取原文
           

摘要

Spectral subtraction is used in this research as a method to remove noise from noisy speech signals in the frequency domain. This method consists of computing the spectrum of the noisy speech using the Fast Fourier Transform (FFT) and subtracting the average magnitude of the noise spectrum from the noisy speech spectrum. We applied spectral subtraction to the speech signal “Real graph”. A digital audio recorder system embedded in a personal computer was used to sample the speech signal “Real graph” to which we digitally added vacuum cleaner noise. The noise removal algorithm was implemented using Matlab software by storing the noisy speech data into Hanning time-widowed half-overlapped data buffers, computing the corresponding spectrums using the FFT, removing the noise from the noisy speech, and reconstructing the speech back into the time domain using the inverse Fast Fourier Transform (IFFT). The performance of the algorithm was evaluated by calculating the Speech to Noise Ratio (SNR). Frame averaging was introduced as an optional technique that could improve the SNR. Seventeen different configurations with various lengths of the Hanning time windows, various degrees of data buffers overlapping, and various numbers of frames to be averaged were investigated in view of improving the SNR. Results showed that using one-fourth overlapped data buffers with 128 points Hanning windows and no frames averaging leads to the best performance in removing noise from the noisy speech.
机译:在本研究中,使用频谱减法作为一种方法来消除频域中嘈杂语音信号的噪声。此方法包括使用快速傅立叶变换(FFT)计算有声语音的频谱,并从有声语音频谱中减去噪声频谱的平均幅度。我们将频谱减法应用于语音信号“实图”。使用嵌入在个人计算机中的数字音频记录器系统对语音信号“实图”进行采样,我们在其中数字地添加了真空吸尘器的噪声。噪声消除算法是使用Matlab软件实现的,方法是将嘈杂的语音数据存储到Hanning时光重叠的半重叠数据缓冲区中,使用FFT计算相应的频谱,从嘈杂的语音中消除噪声,并将语音重新构建为时间快速傅里叶逆变换(IFFT)通过计算语音信噪比(SNR)评估算法的性能。引入帧平均作为可提高SNR的可选技术。为了改善SNR,研究了具有不同长度的汉宁时间窗,不同程度的数据缓冲区重叠以及要平均的各种帧的十七种不同配置。结果表明,使用具有128个点的Hanning窗口且没有帧平均的四分之一重叠数据缓冲区,可以在消除嘈杂语音中的噪声方面达到最佳性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号