首页> 外文期刊>International Journal of Engineering Intelligent Systems for Electrical Engineering and Co >A multiplicative multi-linear model for inter-camera prediction in free view 3D systems
【24h】

A multiplicative multi-linear model for inter-camera prediction in free view 3D systems

机译:自由视图3D系统中摄像机间预测的乘法多线性模型

获取原文
获取原文并翻译 | 示例
       

摘要

Recordings from multiple cameras in the context of 3D television are transmitted using a simulcast encoding structure or a multi-view encoding structure. Multi-view Video Coding (MVC) extends H.264/AVC standard and exploits the large amount of inter-view statistical dependencies by combined temporal and inter-view prediction in 3D systems. We propose herein an alternative object oriented video coding scheme for multi-view video that introduces an intermediate step of estimating highly correlated sets of image parameters per Croup-Of-Pictures (GOP). A structure that we call a Multi-view Video Plane (MVP) is defined. It holds the cross-correlation coefficients between views. The proposed method may be applied upon groups of orthonormal motion vector parameters corresponding to different views as well. No specific knowledge of the extrinsic or the intrinsic parameters of the recording system is required. Object planes associated with a certain view are approximated as multi-linear components of an image that are cross-correlated with object planes in other views in a tensor-like fashion. The order of the tensor equals the number of multiple views. The correlation coefficients of the tensor subspace projections as well as the updates of the multi-linear components, i.e. object-planes (MVPs) and sets of motion vector parameters (that we call Motion Prediction video-Objects or MPOs), are quantized and transmitted in the MPEG stream. Residual object planes are encoded using conventional MPEG algorithms. Numerical results suggest an improvement compared with current encoding approaches with a moderate increase in computational burden.
机译:使用联播编码结构或多视图编码结构来传输3D电视环境中来自多个摄像机的记录。多视图视频编码(MVC)扩展了H.264 / AVC标准,并通过在3D系统中组合时间和视图间预测来利用大量视图间统计依存关系。我们在此提出一种用于多视点视频的替代的面向对象的视频编码方案,该方案引入了一个中间步骤,该中间步骤是估计每个图片组(GOP)的图像参数的高度相关集合。定义了一种我们称为多视图视频平面(MVP)的结构。它包含视图之间的互相关系数。所提出的方法也可以应用于与不同视图相对应的正交运动矢量参数组。不需要对记录系统的外部参数或固有参数有特定的了解。与某个视图相关联的对象平面被近似为图像的多线性分量,这些图像与其他视图中的对象平面以张量式方式互相关。张量的顺序等于多个视图的数量。张量子空间投影的相关系数以及多线性分量(即对象平面(MVP)和运动矢量参数集(我们称为运动预测视频对象或MPO))的更新被量化并传输在MPEG流中。残余物平面使用常规MPEG算法进行编码。数值结果表明,与当前的编码方法相比,计算量有所增加。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号