Perceptual speech coding and enhancement using frame-synchronized fast wavelet packet transform algorithms

Carnero B.; Drygajlo A.

首页> 外文期刊>IEEE Transactions on Signal Processing >Perceptual speech coding and enhancement using frame-synchronized fast wavelet packet transform algorithms

【24h】

Perceptual speech coding and enhancement using frame-synchronized fast wavelet packet transform algorithms

机译：使用帧同步快速小波包变换算法的感知语音编码和增强

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper presents new wideband speech coding and integrated speech coding-enhancement systems based on frame-synchronized fast wavelet packet transform algorithms. It also formulates temporal and spectral psychoacoustic models of masking adapted to wavelet packet analysis. The algorithm of the proposed FFT-like overlapped block orthogonal wavelet packet transform permits us to efficiently approximate the auditory critical band decomposition in the time and frequency domains. This allows us to make use of the temporal and spectral masking properties of the human auditory system to decrease the average bit rate of the encoder while perceptually hiding the quantization error. The same wavelet packet representation is used to merge speech enhancement and coding in the context of auditory modeling. The advantage of the method presented in this paper over previous approaches is that perceptual enhancement and coding, which is usually implemented as a cascade of two separate systems, are combined. This leads to a decreased computational load. Experiments show that the proposed wideband coding procedure by itself can achieve transparent coding of speech signals sampled at 16 kHz at an average bit rate of 39.4 kbit/s. The combined speech coding-enhancement procedure achieves higher bit rate values that depend on the residual noise characteristics at the output of the enhancement process.

机译：本文提出了一种新的基于帧同步快速小波包变换算法的宽带语音编码和集成语音编码增强系统。它还制定了适用于小波包分析的掩蔽的时间和频谱心理声学模型。提出的类似FFT的重叠块正交小波包变换算法使我们能够在时域和频域有效地近似听觉临界带分解。这使我们能够利用人类听觉系统的时间和频谱掩蔽属性来降低编码器的平均比特率，同时在感知上隐藏量化误差。在听觉建模的上下文中，相同的小波包表示用于合并语音增强和编码。与以前的方法相比，本文提出的方法的优势在于，通常将感知增强和编码（通常作为两个独立系统的级联实现）组合在一起。这导致减少的计算负荷。实验表明，所提出的宽带编码程序本身可以实现以39.4 kbit / s的平均比特率对16 kHz采样的语音信号进行透明编码。组合的语音编码增强过程实现了更高的比特率值，该值取决于增强过程输出处的残留噪声特性。

著录项

来源
《IEEE Transactions on Signal Processing》 |1999年第6期|P.1622-1635|共14页
作者
Carnero B.; Drygajlo A.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类通信理论;
关键词

相似文献

外文文献
中文文献
专利

1. Speech enhancement using perceptually-constrained gain factors in critical-band-wavelet-packet transform [J] . C.-T. Lu, H.-C. Wang Electronics Letters . 2004,第6期

机译：关键频带小波包变换中使用感知受限增益因子的语音增强
2. Perceptual Wavelet packet transform based Wavelet Filter Banks Modeling of Human Auditory system for improving the intelligibility of voiced and unvoiced speech: A Case Study of a system development [J] . Ranganadh Narayanam International Journal on Computer Science and Engineering . 2015,第10期

机译：基于感知小波包变换的人类听觉系统小波滤波器组建模，以提高浊音和清音的清晰度：以系统开发为例
3. Adaptive Variable Degree-kZero-Trees for Re-Encoding of Perceptually Quantized Wavelet Packet Transformed Audio and High-Quality Speech [J] . OmidGhahabi, Mohammad HassanSavoji International Scholarly Research Notices . 2011,第5期

机译：用于重新编码感知量化小波包的自适应可变程度 - kzero树改变了音频和高质量的语音
4. Perceptual coding of speech using a fast wavelet packet transform algorithm [C] . Carnero Benito, Drygajlo Andrzej European Signal Processing Conference . 1996

机译：使用快速小波包变换算法的语音感知编码
5. Algorithms for wavelet transforms and adaptive wavelet packet decompositions. [D] . Taswell, Carl. 1995

机译：小波变换和自适应小波包分解的算法。
6. Identity Vector Extraction by Perceptual Wavelet Packet Entropy and Convolutional Neural Network for Voice Authentication [O] . Lei Lei, Kun She 2018

机译：由感知小波包熵和卷积神经网络进行语音认证的特性矢量提取
7. PERCEPTUAL CODING OF SPEECH USING A FAST WAVELET PACKET TRANSFORM ALGORITHM [O] . Carnero B., Drygajlo A. 1996

机译：快速小波包变换算法的语音感知编码
8. Comparison of Arithmetic Requirements for the PFA (Prime Factor Algorithm), WFTA (Winograd Fourier Transform Algorithm), SWIFT, MFFT (Mixed Radix Fast Fourier Transform), FFT (Fast Fourier Transform) and DFT (Discrete Fourier Transform) Algorithms. [R] . Hicks, R. C. 1982

机译：pFa（素因子算法），WFTa（Winograd傅立叶变换算法），sWIFT，mFFT（混合基线快速傅里叶变换），FFT（快速傅立叶变换）和DFT（离散傅立叶变换）算法的算术要求的比较。

Perceptual speech coding and enhancement using frame-synchronized fast wavelet packet transform algorithms

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅