Variable Rate Characteristic Waveform Interpolation Speech Coder Based on Phonetic Classification

WANG Jing; KUANG Jing-ming; ZHAO Sheng-hui

首页> 外文期刊>Journal of Beijing Institute of Technology >Variable Rate Characteristic Waveform Interpolation Speech Coder Based on Phonetic Classification

【24h】

Variable Rate Characteristic Waveform Interpolation Speech Coder Based on Phonetic Classification

机译：基于语音分类的可变速率特征波形内插语音编码器

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

A variable-bit-rate characteristic waveform interpolation (VBR-CWI) speech codec with about 1.8 kbit/s average bit rate which integrates phonetic classification into characteristic waveform (CW) decomposition is proposed. Each input frame is classified into one of 4 phonetic classes. Non-speech frames are represented with Bark-band noise model. The extracted CWs become rapidly evolving waveforms (REWs) or slowly evolving waveforms (SEWs) in the cases of unvoiced or stationary voiced frames respectively, while mixed voiced frames use the same CW decomposition as that in the conventional CWI. Experimental results show that the proposed codec can eliminate most buzzy and noisy artifacts existing in the fixed-bit-rate characteristic waveform interpolation (FBR-CWI) speech codec, the average bit rate can be much lower, and its reconstructed speech quality is much better than FS 1 016 CELP at 4.8 kbit/s and similar to G.723.1 ACELP at 5.3 kbit/s.

机译：提出了一种平均比特率约为1.8 kbit / s的可变比特率特征波形内插（VBR-CWI）语音编解码器，它将语音分类集成到特征波形（CW）分解中。每个输入帧被分为4个语音类别之一。非语音帧用Bark-band噪声模型表示。分别在无声或固定浊音帧的情况下，提取的CW变为快速演化波形（REW）或缓慢演化波形（SEW），而混合浊音帧则使用与常规CWI中相同的CW分解。实验结果表明，所提出的编解码器可以消除固定比特率特征波形插值（FBR-CWI）语音编解码器中存在的大多数嗡嗡声和杂音，平均比特率可以更低，并且其重构语音质量更好。比FS 1 016 CELP的速率为4.8 kbit / s，类似于G.723.1 ACELP的速率为5.3 kbit / s。

著录项

来源
《Journal of Beijing Institute of Technology》 |2007年第2期|p.187-192|共6页
作者
WANG Jing; KUANG Jing-ming; ZHAO Sheng-hui;
展开▼
作者单位

School of Information Science and Technology, Beijing Institute of Technology, Beijing 100081, China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类一般工业技术;
关键词
variable bit rate speech coding; characteristic waveform interpolation; phonetic classification;

机译：可变比特率语音编码;特征波形插值;语音分类;

相似文献

外文文献
中文文献
专利

1. Variable frame rate speech coding using optimal interpolation [J] . Chii-Jen Chung, Sin-Horng Chen IEEE Transactions on Communications . 1994,第6期

机译：使用最优插值的可变帧率语音编码
2. Voiced speech coding at very low bit rates based onforward-backward waveform prediction [J] . Gao Yang, Leich H., Boite R. IEEE Transactions on Speech and Audio Proceessing . 1995,第1期

机译：基于前向-后向波形预测的非常低比特率的浊音编码
3. Voiced speech coding at very low bit rates based on forward-backward waveform prediction [J] . Gao Yang, Leich H. IEEE Transactions on Speech and Audio Proceeding . 1995,第1期

机译：基于前向后波形预测的非常低比特率的浊音编码
4. A new variable rate speech coder based on fuzzy phonetic classification and CS-acelp structure [C] . Beritelli, F. International conference on signal processing;ICSP '98 . 1998

机译：基于模糊语音分类和CS-acelp结构的新型可变速率语音编码器
5. Variable rate speech coding based on subband measures of spectral flatness [D] . McClellan, Stan 1995

机译：基于频谱平坦度子带测度的可变速率语音编码
6. Inter-rater Reliability of 4-Item Arterial Doppler Waveform Classification System for Description of Arterial Doppler Waveforms [O] . Rui Zhao, Damien Lanéelle, Meiying Gao, 2020

机译：4项动脉多普勒波形分类系统的帧间间可靠性用于说明动脉多普勒波形
7. Very low rate speech coding using temporal decomposition and waveform interpolation [O] . Ritz, C. H., Burnett, I., Lukasiak, J 2000

机译：使用时间分解和波形插值的超低速语音编码
8. Simulation and Evaluation of Phonetic Speech Recognition Techniques. Volume III. Acoustical Characteristics of Speech Sounds Systematically Arranged in Form of Tables [R] . Otten, K. W. 1964

机译：语音识别技术的仿真与评估。第三卷。以表格形式系统地排列的语音的声学特征

Variable Rate Characteristic Waveform Interpolation Speech Coder Based on Phonetic Classification

摘要

著录项

相似文献

相关主题

期刊订阅