Speech analysis/synthesis and modification using an analysis-by-synthesis/overlap-add sinusoidal model

George E.B.; Smith M.J.T.

首页> 外文期刊>IEEE Transactions on Speech and Audio Proceeding >Speech analysis/synthesis and modification using an analysis-by-synthesis/overlap-add sinusoidal model

【24h】

Speech analysis/synthesis and modification using an analysis-by-synthesis/overlap-add sinusoidal model

机译：语音分析/合成和使用合成分析/重叠叠加正弦模型的修改

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Sinusoidal modeling has been successfully applied to a broad range of speech processing problems, and offers advantages over linear predictive modeling and the short-time Fourier transform for speech analysis/synthesis and modification. This paper presents a novel speech analysis/synthesis system based on the combination of an overlap-add sinusoidal model with an analysis-by-synthesis technique to determine the model parameters. It describes this analysis procedure in detail, and introduces an equivalent frequency-domain algorithm that takes advantage of the computational efficiency of the fast Fourier transform (FFT). In addition, a refined overlap-add sinusoidal model capable of shape-invariant speech modification is derived, and a pitch-scale modification algorithm is defined that preserves speech bandwidth and eliminates noise migration effects. Analysis-by-synthesis achieves very high synthetic speech quality by accurately estimating the component frequencies, eliminating sidelobe interference effects, and effectively dealing with nonstationary speech events. The refined overlap-add synthesis model correlates well with analysis-by-synthesis, and modifies speech without objectionable artifacts by explicitly controlling shape invariance and phase coherence. The proposed analysis-by-synthesis/overlap-add (ABS/OLA) system allows for both fixed and time-varying time-, frequency-, and pitch-scale modifications, and computational shortcuts using the FFT algorithm make its implementation feasible using currently available hardware.

机译：正弦建模已成功应用于广泛的语音处理问题，与线性预测建模和用于语音分析/合成和修改的短时傅立叶变换相比，具有优势。本文提出了一种新颖的语音分析/合成系统，该系统基于重叠叠加正弦模型与合成分析技术确定模型参数的结合。它详细描述了此分析过程，并介绍了一种等效的频域算法，该算法利用了快速傅立叶变换（FFT）的计算效率。此外，推导了能够进行形状不变的语音修改的精巧重叠叠加正弦模型，并定义了一种音调比例修改算法，该算法可保留语音带宽并消除噪声迁移效应。通过精确地估计分量频率，消除旁瓣干扰效应以及有效地处理非平稳语音事件，“按合成分析”可实现很高的合成语音质量。改进的重叠叠加合成模型与综合分析密切相关，并通过显式控制形状不变性和相位相干性来修改语音而没有令人讨厌的伪影。拟议的综合分析/重叠分析（ABS / OLA）系统允许进行固定和时变的时间，频率和音调比例修改，并且使用FFT算法的计算快捷方式使其在当前的情况下可行可用的硬件。

著录项

来源
《IEEE Transactions on Speech and Audio Proceeding》 |1997年第5期|P.389-406|共18页
作者
George E.B.; Smith M.J.T.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类电声技术和语音信号处理;
关键词

相似文献

外文文献
中文文献
专利

1. Speech analysis/synthesis and modification using ananalysis-by-synthesis/overlap-add sinusoidal model [J] . George E.B., Smith M.J.T. IEEE Transactions on Speech and Audio Proceessing . 1997,第5期

机译：语音分析/合成和修饰，使用合成分析/重叠正弦模型
2. Analysis-by-Synthesis Sinusoidal Model without an Overlapping Scheme [J] . Jong-Hark KIM, Gyu-Hyeok JEONG, In-Sung LEE IEICE Transactions on Communications . 2008,第6期

机译：无重叠方案的综合分析正弦模型
3. A study of acoustic-to-articulatory inversion of speech by analysis-by-synthesis using chain matrices and the Maeda articulatory model [J] . Sankaran Panchapagesan, Abeer Alwan The Journal of the Acoustical Society of America . 2011,第4期

机译：使用链矩阵和前田发音模型通过合成分析对语音进行语音到发音发音转换的研究
4. Speech coding with an analysis-by-synthesis sinusoidal model [C] . Etemoglu, C.O., Cuperman, . 2000

机译：带有综合分析正弦模型的语音编码
5. Analysis-by-synthesis multimode harmonic speech coding at low bit rate. [D] . Li, Chunyan. 2000

机译：低比特率的综合分析多模谐波语音编码。
6. A study of acoustic-to-articulatory inversion of speech by analysis-by-synthesis using chain matrices and the Maeda articulatory model [O] . Sankaran Panchapagesan, Abeer Alwan -1

机译：使用链矩阵和前田发音模型通过合成分析对语音进行语音到发音发音转换的研究
7. Speech Coding With An Analysis-By-Synthesis Sinusoidal Model [O] . Ca Gr Ozgenc, Vladimir Cuperman, Allen Gersho 2000

机译：综合分析正弦模型的语音编码
8. Selective Linear Prediction and Analysis-by-Synthesis in Speech Analysis [R] . Makhoul, J. 1974

机译：语音分析中的选择性线性预测和合成分析

Speech analysis/synthesis and modification using an analysis-by-synthesis/overlap-add sinusoidal model

摘要

著录项

相似文献

相关主题

期刊订阅