...
首页> 外文期刊>Turkish Journal of Electrical Engineering and Computer Sciences >Bandwidth extension of narrowband speech in log spectra domain using neural network
【24h】

Bandwidth extension of narrowband speech in log spectra domain using neural network

机译:基于神经网络的对数谱域窄带语音带宽扩展

获取原文

摘要

In recent years, there have been significant advances in communication technology, but speech signals still suffer from low perceived quality caused by bandwidth limitations of telephone networks. The bandwidth extension (BWE) approach adds high-frequency components of the speech signal to band-limited telephone speech and increases speech perception significantly. In this work, we develop a new method for representation of vocal tract filter coefficients using log of filter bank energy (LFBE) parameters as an alternative for mel-frequency cepstral coefficients (MFCCs). This approach is based on a strong correlation between the spectral components of low- and high-band spectrums. Furthermore, the performances of Gaussian mixture model and multilayer perceptron neural network methods for estimation of the high-frequency envelope are evaluated. Objective evaluations of the obtained results indicate that the LFBE feature vectors have better performance than the MFCCs. In addition, findings of the objective evaluations showed that using a neural network, which is not common in BWE, achieves a better performance as compared to the Gaussian mixture model.
机译:近年来,通信技术取得了长足的进步,但是语音信号仍然遭受由于电话网络的带宽限制而导致的低感知质量。带宽扩展(BWE)方法将语音信号的高频分量添加到带宽受限的电话语音中,并显着增加了语音感知。在这项工作中,我们使用滤波器组能量(LFBE)参数的对数作为梅尔频率倒谱系数(MFCCs)的替代方法,开发了一种表示声道滤波器系数的新方法。该方法基于低频段和高频段频谱的频谱分量之间的强相关性。此外,评估了高斯混合模型和多层感知器神经网络方法估计高频包络的性能。对所获得结果的客观评估表明,LFBE特征向量比MFCC具有更好的性能。此外,客观评估的结果表明,与高斯混合模型相比,使用BWE中不常见的神经网络可获得更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号