Scene-Aware Audio for 360° Videos

Li Dingzeyu; Langlois Timothy R.; Zheng Changxi

首页> 外文期刊>ACM Transactions on Graphics >Scene-Aware Audio for 360° Videos

【24h】

Scene-Aware Audio for 360° Videos

机译：360°视频的场景感知音频

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Although 360 degrees cameras ease the capture of panoramic footage, it remains challenging to add realistic 360 degrees audio that blends into the captured scene and is synchronized with the camera motion. We present a method for adding scene-aware spatial audio to 360 degrees videos in typical indoor scenes, using only a conventional mono-channel microphone and a speaker. We observe that the late reverberation of a room's impulse response is usually diffuse spatially and directionally. Exploiting this fact, we propose a method that synthesizes the directional impulse response between any source and listening locations by combining a synthesized early reverberation part and a measured late reverberation tail. The early reverberation is simulated using a geometric acoustic simulation and then enhanced using a frequency modulation method to capture room resonances. The late reverberation is extracted from a recorded impulse response, with a carefully chosen time duration that separates out the late reverberation from the early reverberation. In our validations, we show that our synthesized spatial audio matches closely with recordings using ambisonic microphones. Lastly, we demonstrate the strength of our method in several applications.

机译：尽管360度摄像机简化了全景镜头的拍摄，但是添加逼真的360度音频以融合到捕获的场景中并与摄像机运动同步仍然是一项挑战。我们提出了一种仅使用传统的单声道麦克风和扬声器将场景感知的空间音频添加到典型室内场景中的360度视频中的方法。我们观察到，房间脉冲响应的后期混响通常在空间和方向上扩散。利用这一事实，我们提出了一种方法，该方法通过组合合成的早期混响部分和测得的后期混响尾音来合成任何源和收听位置之间的定向冲激响应。早期混响使用几何声学模拟进行模拟，然后使用调频方法进行增强以捕获室内共振。从记录的脉冲响应中提取后期混响，并精心选择的持续时间将后期混响与早期混响区分开。在我们的验证中，我们证明了合成的空间音频与使用立体声麦克风的录音非常匹配。最后，我们在几种应用中证明了我们方法的优势。

著录项

来源
《ACM Transactions on Graphics》 |2018年第4cd期|111.1-111.12|共12页
作者
Li Dingzeyu; Langlois Timothy R.; Zheng Changxi;
展开▼
作者单位

Columbia Univ, New York, NY 10027 USA;

Adobe Res, Seattle, WA USA;

Columbia Univ, New York, NY 10027 USA;

展开▼
收录信息美国《科学引文索引》(SCI);
原文格式 PDF
正文语种 eng
中图分类
关键词
360 degrees videos; ambisonic audio;

机译：360度视频;歧义音频;

相似文献

外文文献
中文文献
专利

1. Audio-visual object removal in 360-degree videos [J] . Shimamura Ryo, Feng Qi, Koyama Yuki, The Visual Computer . 2020,第10a12期

机译：360度视频中的视听对象删除
2. Hierarchical multimodal attention for end-to-end audio-visual scene-aware dialogue response generation [J] . Hung Le, Doyen Sahoo, Nancy F. Chen, Computer speech and language . 2020,第Sepa期

机译：端到端视听场景感知对话响应生成的分层多模式关注
3. Investigating topics, audio representations and attention for multimodal scene-aware dialog [J] . Shachi H. Kumar, Eda Okur, Saurav Sahay, Computer speech and language . 2020,第Nova期

机译：调查多模式场景感知对话框的主题，音频表示和注意力
4. End-to-end Audio Visual Scene-aware Dialog Using Multimodal Attention-based Video Features [C] . Chiori Hori, Huda Alamri, Jue Wang, IEEE International Conference on Acoustics, Speech and Signal Processing . 2019

机译：使用基于多模式注意力的视频功能的端到端视听场景感知对话框
5. Improving Quality of Experience for HTTP Adaptive Video Streaming: From Legacy to 360° Videos [D] . Taghavi Nasrabadi, Afshin. 2019

机译：提高HTTP自适应视频流的体验质量：从遗留到360°视频
6. A Simple and Effective Way to Study Executive Functions by Using 360° Videos [O] . Francesca Borgnis, Francesca Baglio, Elisa Pedroli, 2021

机译：使用360°视频研究执行功能的简单有效方法
7. End-to-end Audio Visual Scene-aware Dialog Using Multimodal Attention-based Video Features [O] . Chiori Hori, Huda Alamri, Jue Wang, 2019

机译：使用基于多模式关注的视频功能的端到端音频视觉场景感知对话框

Scene-Aware Audio for 360° Videos

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅