首页> 外文期刊>IEEE Transactions on Speech and Audio Proceeding >Transform representation of the spectra of acoustic speech segments with applications. II. Speech analysis, synthesis, and coding
【24h】

Transform representation of the spectra of acoustic speech segments with applications. II. Speech analysis, synthesis, and coding

机译:借助应用程序来变换语音片段的频谱表示。二。语音分析,合成和编码

获取原文
获取原文并翻译 | 示例

摘要

For Part I see ibid., vol.1, no.2, p.180-95 (1993). In Part I of this paper, the authors introduced an approach to the representation of the speech spectral envelope which makes use of the Karhunen-Loeve (KL) transformation of acoustic subword segments. This signal-dependent representation captures, with a few KL vectors and transform coefficients, the perceptually and phonetically important structure of the spectral envelope. Here the authors apply this representation to the analysis, synthesis, and coding of speech. They propose simple quantization and coding strategies for the KL representation vectors as well as for the resulting transform coefficients. The resulting technique is a variable rate encoding scheme which achieves good speech quality at an average rate of 3.5 kb/s.
机译:对于第一部分,请参见同上,第1卷,第2期,第180-95页(1993)。在本文的第一部分中,作者介绍了一种利用语音子词段的Karhunen-Loeve(KL)变换来表示语音频谱包络的​​方法。这种依赖于信号的表示形式,利用一些KL向量和变换系数,捕获了频谱包络的​​在感知和语音上重要的结构。在这里,作者将这种表示形式应用于语音的分析,合成和编码。他们为KL表示向量以及由此产生的变换系数提出了简单的量化和编码策略。最终的技术是一种可变速率编码方案,该方案以3.5 kb / s的平均速率实现了良好的语音质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号