H2-Stereo: High-Speed, High-Resolution Stereoscopic Video System

Ming Cheng; Yiling Xu; Wang ShenM. Salman AsifChao MaJun SunZhan Ma

首页> 外文期刊>IEEE Transactions on Broadcasting >H2-Stereo: High-Speed, High-Resolution Stereoscopic Video System

【24h】

H2-Stereo: High-Speed, High-Resolution Stereoscopic Video System

机译：H2-Stereo：高速、高分辨率立体视频系统

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

High-speed, high-resolution stereoscopic (H2-Stereo) video allows us to perceive dynamic 3D content at fine granularity. The acquisition of H2-Stereo video, however, remains challenging with commodity cameras. Existing spatial super-resolution or temporal frame interpolation methods provide compromised solutions that lack temporal or spatial details, respectively. To alleviate this problem, we propose a dual camera system, in which one camera captures high-spatial-resolution low-frame-rate (HSR-LFR) videos with rich spatial details, and the other captures low-spatial-resolution high-frame-rate (LSR-HFR) videos with smooth temporal details. We then devise a Learned Information Fusion network (LIFnet) that exploits the cross-camera redundancies to enhance both camera views to high spatiotemporal resolution (HSTR) for reconstructing the H2-Stereo video effectively. We utilize a disparity network to transfer spatiotemporal information across views even in large disparity scenes, based on which, we propose disparity-guided flow-based warping for LSR-HFR view and complementary warping for HSR-LFR view. A multi-scale fusion method in feature domain is proposed to minimize occlusion-induced warping ghosts and holes in HSR-LFR view. The LIFnet is trained in an end-to-end manner using our collected high-quality Stereo Video dataset from YouTube. Extensive experiments demonstrate that our model outperforms existing state-of-the-art methods for both views on synthetic data and camera-captured real data with large disparity. Ablation studies explore various aspects, including spatiotemporal resolution, camera baseline, camera desynchronization, long/short exposures and applications, of our system to fully understand its capability for potential applications.

机译：高速、高分辨率立体（H2-Stereo）视频使我们能够以精细的粒度感知动态 3D 内容。然而，对于商用相机来说，收购H2-Stereo视频仍然具有挑战性。现有的空间超分辨率或时态帧插值方法分别提供了缺少时态或空间细节的折衷解决方案。为了缓解这个问题，我们提出了一种双摄像头系统，其中一台摄像头拍摄具有丰富空间细节的高空间分辨率低帧率（HSR-LFR）视频，另一台摄像头拍摄具有平滑时间细节的低空间分辨率高帧率（LSR-HFR）视频。然后，我们设计了一个学习信息融合网络（LIFnet），该网络利用跨摄像机冗余将两个摄像机视图增强为高时空分辨率（HSTR），以有效地重建H2立体视频。即使在大视差场景中，我们利用视差网络在视图之间传输时空信息，在此基础上，我们提出了基于视差引导的基于流的LSR-HFR视图变形和HSR-LFR视图的互补变形。该文提出一种特征域多尺度融合方法，以最小化HSR-LFR视图中遮挡引起的翘曲重影和孔洞。LIFnet 使用我们从 YouTube 收集的高质量立体声视频数据集以端到端的方式进行训练。大量的实验表明，我们的模型在合成数据视图和相机捕获的真实数据方面都优于现有的最先进的方法，但存在很大差异。消融研究探讨了我们系统的各个方面，包括时空分辨率、相机基线、相机不同步、长/短曝光和应用，以充分了解其潜在应用的能力。

著录项

来源
《IEEE Transactions on Broadcasting 》 |2022年第4期| 886-903| 共18页
作者
Ming Cheng; Yiling Xu; Wang ShenM. Salman AsifChao MaJun SunZhan Ma;
展开▼
作者单位

Cooperative Medianet Innovation Center, Shanghai Jiao Tong University, Shanghai, China;

Institute of Image Communication and Network Engineering, Shanghai Jiao Tong University, Shanghai, China;

Department of Electrical and Computer Engineering, University of California at Riverside, Riverside, CA, USAMoE Key Laboratory of Artificial Intelligence, AI Institute, Shanghai Jiao Tong University, Shanghai, ChinaSchool of Electronic Science and Engineering, Nanjing University, Jiangsu, China;

展开▼
收录信息
原文格式 PDF
正文语种英语
中图分类广播 ;
关键词
Cameras; Stereo image processing; Spatiotemporal phenomena; Superresolution; Spatial resolution; Image reconstruction; Interpolation;

机译：相机;立体图像处理;时空现象;超分辨率;空间分辨率;图像重建;插值;

相似文献

外文文献
中文文献
专利

1. Researchers’ Work from Bahcesehir University Focuses on Intelligent Systems (Three-Stream 3D deep CNN for no-Reference stereoscopic video quality assessment) [J] . Robotics & Machine Learning Daily News . 2022 ,第19期

机译：Researchers’ Work from Bahcesehir University Focuses on Intelligent Systems (Three-Stream 3D deep CNN for no-Reference stereoscopic video quality assessment)
2. Letter a precise stereoscopic system with two video cameras [J] . Stefano Tubaro european transactions on telecommunications . 1992 ,第3期

机译：Letter a precise stereoscopic system with two video cameras
3. A high-resolution high-speed data acquisition system based on an IBM-PC 486, for control of a Fourier transform spectrometer [J] . Z.B.Maričić, L.P.Ellison, B.G.GowlandG.A.Gledhill International Journal of Infrared and Millimeter Waves . 1994 ,第1期

机译：A high-resolution high-speed data acquisition system based on an IBM-PC 486, for control of a Fourier transform spectrometer
4. M-Health Medical Video Communication Systems: An Overview of Design Approaches and Recent Advances [C] . A. S. Panayides, M. S. Pattichis, A. G. Constantinides, IEEE Engineering in Medicine and Biology Society., Conference. . 2013

机译：M-Health Medical Video Communication Systems：设计方法概述和最近的进展
5. 两岸四地累犯制度比较研究——兼论中国內地累犯制度一體化之構想 =Comparative Study on Recidivism System in Hong Kong, Macao, Taiwan and China: Concurrently Discuss the Conception of Recidivism System Integration in Mainland China [D] . Shen, Siqi. 2019

机译：两岸四地累犯制度比较研究——兼论中国内地累犯制度一体化之构想 =Comparative Study on Recidivism System in Hong Kong, Macao, Taiwan and China: Concurrently Discuss the Conception of Recidivism System Integration in Mainland China
6. Validation of noninvasive continuous arterial pressure measurement by ClearSight System™ during induction of anesthesia for cardiovascular surgery [O] . Tadashi Tanioku, Akari Yoshida, Yuichi Aratani, 2020

机译：Clearsight System™在心血管外科诱导期间验证Clearsight System™的验证
7. Stereoscopic video coding. [O] . 1995

机译：stereoscopic video coding.
8. Modification and Validation of an Automotive Data Processing Unit, Compessed Video System, and Communications Equipment [R] . Carter, R. J. 1997

机译：汽车数据处理单元，Compessed Video system和通信设备的修改和验证

H2-Stereo: High-Speed, High-Resolution Stereoscopic Video System

摘要

著录项

相似文献

相关主题

期刊订阅