...
首页> 外文期刊>IEICE transactions on information and systems >Enhancing Stereo Signals with High-Order Ambisonics Spatial Information
【24h】

Enhancing Stereo Signals with High-Order Ambisonics Spatial Information

机译:利用高阶Ambisonics空间信息增强立体声信号

获取原文
           

摘要

There is a strong push towards the ultra-realistic presentation of multimedia contents made possible by the latest advances in computational and signal processing technologies. Three-dimensional sound presentation is necessary to convey a natural and rich multimedia experience. Promising ways to achieve this include the sound field reproduction technique known as high-order Ambisonics (HOA). While these advanced methods are now within the capabilities of consumer-level processing systems, their adoption is hindered by the lack of contents. Production and coding of the audio components in multimedia focus on traditional formats such as stereophonic sound. Mainstream audio codecs and media such as CDs or DVDs do not support advanced, rich contents such as HOA encodings. To ameliorate this problem and speed up the adoption of spatial sound technologies, this paper proposes a novel way to downmix HOA contents into a stereo signal. The resulting data can be distributed using conventional methods such as audio CDs or as the audio component of an internet video stream. The results can be listened to using legacy stereo reproduction systems. However, they include spatial information encoded as the inter-channel level and phase differences. The proposed method consists of a downmixing filterbank which independently modulate inter-channel differences at each frequency bin. The proposal is evaluated using simple test signals and found to outperform conventional methods such as matrix-encoded surround and the Ambisonics UHJ format in terms of spatial resolution. The proposal can be coupled with a previously presented method to recover HOA signals from stereo recordings. The resulting system allows for the preservation of full-surround spatial information in ultra-realistic contents when they are transferred using a stereo stream. Simulation results show that a compatible decoder can accurately recover up to five HOA channels from a stereo signal (2nd order HOA data in the horizontal plane).
机译:随着计算和信号处理技术的最新发展,多媒体内容的超现实呈现正在大力推动。三维声音演示是传达自然丰富的多媒体体验所必需的。实现这一目标的可行方法包括称为高阶Ambisonics(HOA)的声场再现技术。尽管这些高级方法现在处于消费者级处理系统的能力之内,但由于缺乏内容而阻碍了它们的采用。多媒体中音频组件的生产和编码着眼于传统格式,例如立体声。主流音频编解码器和媒体(例如CD或DVD)不支持高级,丰富的内容(例如HOA编码)。为了改善这个问题并加快空间声音技术的采用,本文提出了一种将HOA内容下混为立体声信号的新颖方法。可以使用常规方法(例如音频CD)或作为Internet视频流的音频组件来分发所得数据。可以使用传统的立体声再现系统收听结果。但是,它们包括编码为通道间电平和相位差的空间信息。所提出的方法包括一个降混滤波器组,该降混滤波器组独立地调制每个频率仓上的通道间差异。该提案使用简单的测试信号进行评估,发现在空间分辨率方面优于传统方法,例如矩阵编码的环绕声和Ambisonics UHJ格式。该提议可以与先前提出的方法结合以从立体声录音恢复HOA信号。当使用立体声流传输全环绕空间信息时,最终的系统可以保留超真实内容中的全环绕空间信息。仿真结果表明,兼容的解码器可以从立体声信号(水平面中的二阶HOA数据)中准确地恢复多达五个HOA通道。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号