Audio bandwidth extension based on temporal smoothing cepstral coefficients

Xin Liu; Chang-Chun Bao

首页> 外文期刊>EURASIP journal on audio, speech, and music processing >Audio bandwidth extension based on temporal smoothing cepstral coefficients

【24h】

Audio bandwidth extension based on temporal smoothing cepstral coefficients

机译：基于时间平滑倒谱系数的音频带宽扩展

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose a wideband (WB) to super-wideband audio bandwidth extension (BWE) method based on temporal smoothing cepstral coefficients (TSCC). A temporal relationship of audio signals is included into feature extraction in the bandwidth extension frontend to make the temporal evolution of the extended spectra smoother. In the bandwidth extension scheme, a Gammatone auditory filter bank is used to decompose the audio signal, and the energy of each frequency band is long-term smoothed using minima controlled recursive averaging (MCRA) in order to suppress transient components. The resulting ‘steady-state’ spectrum is processed by frequency weighting, and the temporal smoothing cepstral coefficients are obtained by means of the power-law loudness function and cepstral normalization. The extracted temporal smoothing cepstral coefficients are fed into a Gaussian mixture model (GMM)-based Bayesian estimator to estimate the high-frequency (HF) spectral envelope, while the fine structure is restored by spectral translation. Evaluation results show that the temporal smoothing cepstral coefficients exploit the temporal relationship of audio signals and provide higher mutual information between the low- and high-frequency parameters, without increasing the dimension of input vectors in the frontend of bandwidth extension systems. In addition, the proposed bandwidth extension method is applied into the G.729.1 wideband codec and outperforms the Mel frequency cepstral coefficient (MFCC)-based method in terms of log spectral distortion (LSD), cosh measure, and differential log spectral distortion. Further, the proposed method improves the smoothness of the reconstructed spectrum over time and also gains a good performance in the subjective listening tests.

机译：本文提出了一种基于时间平滑倒频谱系数（TSCC）的宽带（WB）至超宽带音频带宽扩展（BWE）方法。音频信号的时间关系包含在带宽扩展前端的特征提取中，以使扩展频谱的时间演化更加平滑。在带宽扩展方案中，使用Gammatone听觉滤波器组分解音频信号，并使用最小控制递归平均（MCRA）对每个频带的能量进行长期平滑处理，以抑制瞬态分量。通过频率加权处理得到的“稳态”频谱，并通过幂律响度函数和倒频谱归一化获得时间平滑倒频谱系数。提取的时间平滑倒谱系数被馈送到基于高斯混合模型（GMM）的贝叶斯估计器中，以估计高频（HF）频谱包络，而精细结构通过频谱平移得以恢复。评估结果表明，时间平滑倒频谱系数利用了音频信号的时间关系，并在低频和高频参数之间提供了更高的互信息，而没有增加带宽扩展系统前端的输入矢量的维数。此外，所提出的带宽扩展方法已应用于G.729.1宽带编解码器，并且在对数谱失真（LSD），cosh量度和差分对数谱失真方面均优于基于梅尔频率倒谱系数（MFCC）的方法。此外，所提出的方法提高了重建频谱随时间的平滑度，并且在主观听力测试中也获得了良好的性能。

著录项

来源
《EURASIP journal on audio, speech, and music processing》 |2014年第1期|共16页
作者
Xin Liu; Chang-Chun Bao;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. Robust audio fingerprinting based on GammaChirp frequency cepstral coefficients and chroma [J] . Electronics Letters . 2014,第4期

机译：基于GammaChirp频率倒谱系数和色度的稳健音频指纹
2. Audio hash function based on non-negative matrix factorisation of mel-frequency cepstral coefficients [J] . Chen N., Xiao H.-D., Wan W. Information Security, IET . 2011,第1期

机译：基于梅尔频率倒谱系数的非负矩阵分解的音频哈希函数
3. Superwideband Bandwidth Extension Using Normalized MDCT Coefficients for Scalable Speech and Audio Coding [J] . Young Han Lee, Seung Ho Choi Advances in multimedia . 2013,第期

机译：使用归一化MDCT系数进行可扩展语音和音频编码的超宽带扩展
4. Mel-Frequency Cepstral Coefficient-Based Bandwidth Extension of Narrowband Speech [C] . Amr H. Nour-Eldin, Peter Kabal International Speech Communication Association . 2008

机译：窄带语音的熔体频率谱系系数的带宽扩展
5. Estimation of cepstral coefficients for robust speech recognition. [D] . Indrebo, Kevin M. 2008

机译：倒频谱系数的估计，用于鲁棒的语音识别。
6. Voice Disorder Classification Based on Multitaper Mel Frequency Cepstral Coefficients Features [O] . Ömer Eskidere, Ahmet Gürhanlı 2015

机译：基于多锥梅尔频率倒谱系数特征的语音障碍分类
7. Audio bandwidth extension based on temporal smoothing cepstral coefficients [O] . Xin Liu, Chang-Chun Bao 2014

机译：基于时间平滑倒谱系数的音频带宽扩展

Audio bandwidth extension based on temporal smoothing cepstral coefficients

摘要

著录项

相似文献

相关主题

期刊订阅