首页> 外文OA文献 >Research on key technologies in multiview video and interactive multiview video streaming
【2h】

Research on key technologies in multiview video and interactive multiview video streaming

机译:多视点视频和交互式多视点视频流中的关键技术研究

摘要

Emerging video applications are being developed where multiple views of a scene are captured. Two central issues in the deployment of future multiview video (MVV) systems are compression efficiency and interactive video experience, which makes it necessary to develop advanced technologies on multiview video coding (MVC) and interactive multiview video streaming (IMVS). The former aims at efficient compression of all MVV data in a ratedistortion (RD) optimal manner by exploiting both temporal and inter-view redundancy, while the latter offers a viewer the ability to freely interact with MVV data, such that she can periodically request her desired viewpoint as the video is played back. Based on the observation that MVC and IMVS are fundamentally different MVV problems, in this thesis, we focus on developing different algorithms for practical MVC and IMVS designs.The first part of the thesis focuses on our research works on MVC. We first develop projective rectification-based view interpolation and extrapolation methods and apply them to MVC. Experimental results show that these schemes can achieve better RD performance than the current joint multiview video coding (JMVC) standard as well as view interpolation and extrapolation-based MVC schemes without using rectification. To explain the experimental results, we also develop mathematical models for the rectification-based view interpolation and extrapolation, from which we develop an improved theoretical model to compare the performances of various MVC schemes. Simulation results can verify the experimental results very well. In the second part of the thesis, we propose three major technological improvements to existing IMVS works to enhance its interactivity experience and implement it in a realistic network condition. First, in addition to camera-captured views, we make available additional virtual views between each pair of captured views for viewers’ selection, by transmitting both texture and depth maps of neighboring captured views and synthesizing intermediate views at decoder using depth-based image rendering (DIBR). Second, we construct a Markovian view-switching model that more accurately captures viewers’ behaviors. Third, we optimize frame structures and schedule the transmission of frames in a network-delay-cognizant manner, so that viewers can enjoy zero-delay view-switching even over transmission network with non-negligible network delay.
机译:正在开发捕捉场景的多个视图的新兴视频应用程序。未来的多视图视频(MVV)系统部署中的两个主要问题是压缩效率和交互式视频体验,这使得有必要开发有关多视图视频编码(MVC)和交互式多视图视频流(IMVS)的先进技术。前者旨在通过利用时间冗余和视图间冗余,以评级优化(RD)最佳方式有效压缩所有MVV数据,而后者则为观看者提供了与MVV数据自由交互的能力,因此她可以定期请求播放视频时所需的视点。在观察到MVC和IMVS是本质上不同的MVV问题的基础上,本文重点研究为实际MVC和IMVS设计开发不同的算法。本文的第一部分着重于我们对MVC的研究工作。我们首先开发基于投影校正的视图内插和外推方法,并将其应用于MVC。实验结果表明,与当前的联合多视图视频编码(JMVC)标准以及基于视图插值和基于外推的MVC方案相比,这些方案无需校正即可实现更好的RD性能。为了解释实验结果,我们还为基于整流的视图内插和外推开发了数学模型,从中我们开发了一种改进的理论模型来比较各种MVC方案的性能。仿真结果可以很好地验证实验结果。在论文的第二部分中,我们提出了对现有IMVS工作的三项主要技术改进,以增强其交互体验并在现实的网络条件下实现。首先,除了摄像机捕获的视图之外,我们还通过传输相邻捕获视图的纹理图和深度图并在解码器上使用基于深度的图像渲染合成中间视图,在每对捕获视图之间提供其他虚拟视图供观众选择(DIBR)。其次,我们构建了一个马尔可夫视图切换模型,可以更准确地捕获观众的行为。第三,我们优化帧结构并以网络延迟识别的方式安排帧的传输,使观看者即使在传输网络中网络延迟不可忽略的情况下也可以享受零延迟视图切换。

著录项

  • 作者

    Xiu Xiaoyu;

  • 作者单位
  • 年度 2011
  • 总页数
  • 原文格式 PDF
  • 正文语种
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号