首页> 外文会议>Pacific-Rim conference on multimedia >Low Bitrates Audio Bandwidth Extension Using a Deep Auto-Encoder
【24h】

Low Bitrates Audio Bandwidth Extension Using a Deep Auto-Encoder

机译:使用深度自动编码器的低比特率音频带宽扩展

获取原文

摘要

Modern audio coding technologies apply methods of bandwidth extension (BWE) to efficiently represent audio data at low bitrates. An established method is the well-known spectral band replication (SBR) that can provide the very high sound quality with imperceptible artifact. However, its bitrates and complexity are very high. Another great method is LPC-based BWE, which is part of 3GPP AMR-WB+ codec. Although its bitrates and complexity are reduced distinctly, the sound quality it provided is unsatisfactory for music. In this paper, a novel bandwidth extension method is proposed which provided the high sound quality close to eSBR, with only 0.8 kbps bitrates. The proposed method predicts the fine structure of high frequency band from low frequency band by a deep auto-encoder, and only extracts the envelope of high frequency as side information. The performance evaluation demonstrates the advantage of the proposed method compared to the state of the art. Compared with eSBR, the bitrates drop about 63 %, and the subjective listening quality is close to it. Compared with LPC-based BWE, the subjective listening quality is better than it with the same bitrates.
机译:现代音频编码技术应用带宽扩展(BWE)的方法来有效地表示低比特率的音频数据。一种已建立的方法是众所周知的谱带复制(SBR),它可以提供非常高的音质和难以察觉的伪影。但是,它的比特率和复杂性很高。另一个很棒的方法是基于LPC的BWE,它是3GPP AMR-WB +编解码器的一部分。尽管它的比特率和复杂性明显降低了,但是它提供的声音质量却不能令人满意。在本文中,提出了一种新颖的带宽扩展方法,该方法提供了接近eSBR的高音质,并且只有0.8 kbps的比特率。该方法通过深度自动编码器从低频频段预测高频频段的精细结构,仅提取高频包络作为辅助信息。与现有技术相比,性能评估证明了所提出方法的优势。与eSBR相比,比特率下降了约63%,主观聆听质量也接近它。与基于LPC的BWE相比,主观收听质量要好于相同比特率的情况。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号