Total Least Squares (TLS) algorithms automatically decompose (audio) frames into a number of exponentially dampedsinusoids. This can provide for more ecient modeling than plain sinusoidal modeling, especially in the case oftransitional frames. Straightforward implementations of TLS optimize a SNR criterion. In our implementation weapply TLS in a subband scheme in which the number of damped sinusoids is both frame and subband dependent.This is made possible through the use of perceptual information provided by the MPEG-I psycho-acoustic model I.Experiments on dierent audio tracks provide proof of concept for our perceptual ESM, and illustrate the signicantreduction in modeling components compared to a non-perceptual ESM.
展开▼