Transform representation of the spectra of acoustic speech segments with applications. II. Speech analysis, synthesis, and coding

Algazi V.R.; Brown K.L.

首页> 外文期刊>IEEE Transactions on Speech and Audio Proceeding >Transform representation of the spectra of acoustic speech segments with applications. II. Speech analysis, synthesis, and coding

【24h】

Transform representation of the spectra of acoustic speech segments with applications. II. Speech analysis, synthesis, and coding

机译：借助应用程序来变换语音片段的频谱表示。二。语音分析，合成和编码

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

For Part I see ibid., vol.1, no.2, p.180-95 (1993). In Part I of this paper, the authors introduced an approach to the representation of the speech spectral envelope which makes use of the Karhunen-Loeve (KL) transformation of acoustic subword segments. This signal-dependent representation captures, with a few KL vectors and transform coefficients, the perceptually and phonetically important structure of the spectral envelope. Here the authors apply this representation to the analysis, synthesis, and coding of speech. They propose simple quantization and coding strategies for the KL representation vectors as well as for the resulting transform coefficients. The resulting technique is a variable rate encoding scheme which achieves good speech quality at an average rate of 3.5 kb/s.

机译：对于第一部分，请参见同上，第1卷，第2期，第180-95页（1993）。在本文的第一部分中，作者介绍了一种利用语音子词段的Karhunen-Loeve（KL）变换来表示语音频谱包络的方法。这种依赖于信号的表示形式，利用一些KL向量和变换系数，捕获了频谱包络的在感知和语音上重要的结构。在这里，作者将这种表示形式应用于语音的分析，合成和编码。他们为KL表示向量以及由此产生的变换系数提出了简单的量化和编码策略。最终的技术是一种可变速率编码方案，该方案以3.5 kb / s的平均速率实现了良好的语音质量。

著录项

来源
《IEEE Transactions on Speech and Audio Proceeding》 |1993年第3期|P.277-286|共10页
作者
Algazi V.R.; Brown K.L.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类电声技术和语音信号处理;
关键词

相似文献

外文文献
中文文献
专利

1. Transform representation of the spectra of acoustic speech segments with applications. I. General approach and application to speech recognition [J] . Algazi V.R., Brown K.L. IEEE Transactions on Speech and Audio Proceeding . 1993,第2期

机译：借助应用程序来变换语音片段的频谱表示。一，一般方法及其在语音识别中的应用
2. Evaluation of noisy speech recognition by transform of the spectral envelope using analysis-synthesis system [J] . Masanori Akita, Naoki Tsuboe, Yohichi Midorikawa 電子情報通信学会技術研究報告. 音声. Speech . 2000,第467期

机译：使用分析综合系统对频谱包络进行变换来评估嘈杂的语音识别
3. Evaluation of noisy speech recognition by transform of the spectral envelope using analysis-synthesis system [J] . Masanori Akita, Naoki Tsuboe, Yohichi Midorikawa 電子情報通信学会技術研究報告. 音声. Speech . 2000,第467期

机译：用分析合成系统转换光谱包络噪声语音识别
4. A phonetically labeled acoustic segment (PLAS) approach to speech analysis-synthesis [C] . Soong, F.K. . 1989

机译：语音分析的语音标记声段（PLAS）方法
5. Acoustic Representations of Segmental and Metrical Encoding in Speech Production [D] . Myers, Brett R. 2019

机译：语音生产中分段和度量编码的声学表示
6. N1 Repetition-Attenuation for Acoustically Variable Speech and Spectrally Rotated Speech [O] . Ellen Marklund, Lisa Gustavsson, Petter Kallioinen, 2020

机译：N1在声学可变语音和光谱旋转语音的重复衰减
7. Neural markers of speech comprehension: measuring EEG tracking of linguistic speech representations, controlling the speech acoustics [O] . Marlies Gillis, Jonas Vanthornhout, Jonathan Z. Simon, 2021

机译：语音理解的神经标志：测量语言语音表示的脑电图，控制语音声学

Transform representation of the spectra of acoustic speech segments with applications. II. Speech analysis, synthesis, and coding

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅