首页> 外文OA文献 >Étude de transformées temps-fréquence pour le codage audio faible retard en haute qualité
【2h】

Étude de transformées temps-fréquence pour le codage audio faible retard en haute qualité

机译:高质量低延迟音频编码的时频变换研究

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

In recent years there has been a phenomenal increase in the number of products and applications which make use of audio coding formats. Amongthe most successful audio coding schemes, the MPEG-1 Layer III (mp3), the MPEG-2 Advanced Audio Coding (AAC) or its evolution MPEG-4High Efficiency-Advanced Audio Coding (HE-AAC) can be cited. More recently, perceptual audio coding has been adapted to achieve codingat low-delay such to become suitable for conversational applications. Traditionally, the use of filter bank such as the Modified Discrete CosineTransform (MDCT) is a central component of perceptual audio coding and its adaptation to low delay audio coding has become an important researchtopic. Low delay transforms have been developed in order to retain the performance of standard audio coding while reducing dramatically the associated algorithmic delay.This work presents some elements allowing to better accommodate the delay reduction constraint. Among the contributions, a low delay blockswitching tool which allows the direct transition between long transform and short transform without the insertion of transition window. The sameprinciple has been extended to define new perfect reconstruction conditions for the MDCT with relaxed constraints compared to the original definition.As a consequence, a seamless reconstruction method has been derived to increase the flexibility of transform coding schemes with the possibility toselect a transform for a frame independently from its neighbouring frames. Finally, based on this new approach, a new low delay window design procedure has been derived to obtain an analytic definition for a new family of transforms, permitting high quality with a substantial coding delay reduction. The performance of the proposed transforms has been thoroughly evaluated, an evaluation framework involving an objective measurement of the optimal transform sequence is proposed. It confirms the relevance of the proposed transforms used for audio coding. In addition, the new approaches have been successfully applied to the recent standardisation work items, such as the low delay audio coding developed at MPEG (LD-AAC and ELD-AAC) and they have been evaluated with numerous subjective testing, showing a significant improvement of the quality for transient signals. The new low delay window design has been adopted in G.718, a scalable speech and audio codec standardized in ITU-T and has demonstrated its benefit in terms of delay reduction while maintaining the audio quality of a traditional MDCT.
机译:近年来,利用音频编码格式的产品和应用的数量已显着增加。在最成功的音频编码方案中,可以列举MPEG-1 Layer III(mp3),MPEG-2高级音频编码(AAC)或其演进的MPEG-4高级高效音频编码(HE-AAC)。最近,感知音频编码已经被适配为以低延迟实现编码,从而变得适合于会话应用。传统上,诸如改进的离散余弦变换(MDCT)之类的滤波器组的使用是感知音频编码的主要组成部分,其对低延迟音频编码的适应已成为重要的研究课题。为了保持标准音频编码的性能,同时又大大减少了相关的算法延迟,人们开发了低延迟变换。这项工作提出了一些元素,可以更好地适应延迟减少的约束。在这些贡献中,一个低延迟的块切换工具允许在长变换和短变换之间直接进行过渡,而无需插入过渡窗口。与原先的定义相比,扩展了相同的原理来定义具有宽松约束条件的MDCT的新的完美重构条件,结果,推导了一种无缝重构方法以增加变换编码方案的灵活性,并有可能为框架独立于其相邻框架。最终,基于这种新方法,已经推导了新的低延迟窗口设计程序,以获取新的变换族的解析定义,从而在降低编码延迟的同时实现了高质量。已对所提出的变换的性能进行了全面评估,提出了一种评估框架,其中涉及对最佳变换序列的客观测量。它证实了所提出的用于音频编码的变换的相关性。此外,新方法已成功应用于最近的标准化工作项目,例如在MPEG开发的低延迟音频编码(LD-AAC和ELD-AAC),并已通过大量主观测试进行了评估,显示出显着改进瞬态信号的质量。新的低延迟窗口设计已在ITU-T标准化的可扩展语音和音频编解码器G.71​​8中采用,并在降低延迟的同时展示了其优势,同时保持了传统MDCT的音频质量。

著录项

  • 作者

    Virette David;

  • 作者单位
  • 年度 2012
  • 总页数
  • 原文格式 PDF
  • 正文语种 en
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号