【24h】

Speech Enhancement and Encoding using SS-VAD and LPC

机译:使用SS-VAD和LPC进行语音增强和编码

获取原文

摘要

In this paper, an enhancement of noisy speech data and encoding of corrupted and enhanced speech data is presented. An algorithm is proposed which is an amalgamation of spectral subtraction with voice activity detection (SS-VAD) and linear predictive coding (LPC) for speech enhancement and encoding purpose. Firstly, the significance of SS-VAD and LPC methods are studied in detail for various types of noises. In SS-V AD technique, the noisy speech data is considered as an input speech signal which is a combination of clean speech data and noise model. The corrupted speech data is windowed using Hanning window and framed for every 20ms. 50% of overlapping is done while windowing the speech data. The output of SS-VAD is given as an input to the LPC encoder. The coefficients are extracted from the input speech data to design all pole filters. The cross correlation process is also done for differentiating the voiced and unvoiced samples at the analysis step. The pitch information and extracted coefficients are used at the synthesis step. The experiments are conducted for different types of noisy speech data which are degraded by background noise, F16 noise, factory noise, white noise and car noise. The experimental results show that an SS-VAD algorithm significantly improved the signal to noise ratio (SNR) of speech data under various degraded conditions. Therefore, the SS-VAD algorithm is combined with LPC for encoding of enhanced speech data which yields better audibility and intelligibility of speech compared to encoding of noisy speech data.
机译:在本文中,提高了嘈杂的语音数据和损坏和增强语音数据的编码。提出了一种算法,其是具有语音活动检测(SS-VAD)和线性预测编码(LPC)的光谱减法的融合,用于语音增强和编码目的。首先,对各种类型的噪声详细研究了SS-VAD和LPC方法的重要性。在SS-V AD技术中,嘈杂的语音数据被认为是输入语音信号,其是清洁语音数据和噪声模型的组合。使用Hanning窗口窗口窗口窗口窗口并为每20ms框架框架进行窗口。在窗口窗口语音数据时完成了50%的重叠。 SS-VAD的输出作为LPC编码器的输入给出。从输入语音数据中提取系数以设计所有极滤波器。还进行了交叉相关过程,用于在分析步骤中区分浊音和清晰的样品。在合成步骤中使用间距信息和提取的系数。实验是针对不同类型的噪声语音数据进行,这些语音数据被背景噪声,F16噪声,工厂噪声,白噪声和汽车噪声劣化。实验结果表明,SS-VAD算法在各种降解条件下显着改善了语音数据的信噪比(SNR)。因此,SS-VAD算法与LPC组合用于编码增强语音数据,与噪声语音数据的编码相比,对语音的更好的可听性和可懂度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号