首页> 外文期刊>Audio, Speech, and Language Processing, IEEE Transactions on >Bandwidth Extension of Telephone Speech to Low Frequencies Using Sinusoidal Synthesis and a Gaussian Mixture Model
【24h】

Bandwidth Extension of Telephone Speech to Low Frequencies Using Sinusoidal Synthesis and a Gaussian Mixture Model

机译:使用正弦合成和高斯混合模型将电话语音的带宽扩展到低频

获取原文
获取原文并翻译 | 示例

摘要

The quality of narrowband telephone speech is degraded by the limited audio bandwidth. This paper describes a method that extends the bandwidth of telephone speech to the frequency range 0–300 Hz. The method generates the lowest harmonics of voiced speech using sinusoidal synthesis. The energy in the extension band is estimated from spectral features using a Gaussian mixture model. The amplitudes and phases of the synthesized sinusoidal components are adjusted based on the amplitudes and phases of the narrowband input speech, which provides adaptivity to varying input bandwidth characteristics. The proposed method was evaluated with listening tests in combination with another bandwidth extension method for the frequency range 4–8 kHz. While the low-frequency bandwidth extension was not found to improve perceived quality, the method reduced dissimilarity with wideband speech.
机译:有限的音频带宽会降低窄带电话语音的质量。本文介绍了一种将电话语音带宽扩展到0-300 Hz频率范围的方法。该方法使用正弦合成来生成浊音的最低谐波。扩展带中的能量是使用高斯混合模型根据光谱特征估算的。基于窄带输入语音的幅度和相位来调节合成正弦分量的幅度和相位,这为变化的输入带宽特性提供了适应性。结合听觉测试对提出的方法进行了评估,并结合了另一种针对4–8 kHz频率范围的带宽扩展方法。虽然未发现低频带宽扩展可改善感知质量,但该方法减少了宽带语音的相似性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号