首页> 外文OA文献 >Prediction architecture based on block matching statistics for mixed spatial-resolution multi-view video coding
【2h】

Prediction architecture based on block matching statistics for mixed spatial-resolution multi-view video coding

机译:基于块匹配统计的混合空间分辨率多视点视频编码预测架构

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The use of mixed spatial resolutions in multi-view video coding is a promising approach for coding videos efficiently at low bitrates. It can achieve a perceived quality, which is close to the view with the highest quality, according to the suppression theory of binocular vision. The aim of the work reported in this paper is to develop a new multi-view video coding technique suitable for low bitrate applications in terms of coding efficiency, computational and memory complexity, when coding videos, which contain either a single or multiple scenes. The paper proposes a new prediction architecture that addresses deficiencies of prediction architectures for multi-view video coding based on H.264/AVC. The prediction architectures which are used in mixed spatial-resolution multi-view video coding (MSR-MVC) are afflicted with significant computational complexity and require significant memory size, with regards to coding time and to the minimum number of reference frames. The architecture proposed herein is based on a set of investigations, which explore the effect of different inter-view prediction directions on the coding efficiency of multi-view video coding, conduct a comparative study of different decimation and interpolation methods, in addition to analyzing block matching statistics. The proposed prediction architecture has been integrated with an adaptive reference frame ordering algorithm, to provide an efficient coding solution for multi-view videos with hard scene changes. The paper includes a comparative performance assessment of the proposed architecture against an extended architecture based on the 3D digital multimedia broadcast (3D-DMB) and the Hierarchical B-Picture (HBP) architecture, which are two most widely used architectures for MSR-MVC. The assessment experiments show that the proposed architecture needs less bitrate by on average 13.1 Kbps, less coding time by 14% and less memory consumption by 31.6%, compared to a corresponding codec, which deploys the extended 3D-DMB architecture when coding single-scene videos. Furthermore, the codec, which deploys the proposed architecture, accelerates coding by on average 57% and requires 52% less memory, compared to a corresponding codec, which uses the HBP architecture. On the other hand, multi-view video coding which uses the proposed architecture needs more bitrate by on average 24.9 Kbps compared to a corresponding codec that uses the HBP architecture. For coding a multi-view video which has hard scene changes, the proposed architecture yields less bitrate (by on average 28.7 to 35.4 Kbps), and accelerates coding time (by on average 64 and 33%), compared to the HBP and extended 3D-DMB architectures, respectively. The proposed architecture will thus be most beneficial in low bitrate applications, which require multi-view video coding for video content depicting hard scene changes.
机译:在多视图视频编码中使用混合空间分辨率是一种以低比特率有效地编码视频的有前途的方法。根据双目视觉的抑制理论,它可以实现接近最高质量视图的感知质量。本文报道的工作目的是开发一种新的多视点视频编码技术,该技术在对包含单个或多个场景的视频进行编码时,在编码效率,计算和存储复杂性方面均适用于低比特率应用。本文提出了一种新的预测体系结构,该体系结构解决了基于H.264 / AVC的多视图视频编码的预测体系结构的不足。就编码时间和最小参考帧数而言,在混合空间分辨率多视图视频编码(MSR-MVC)中使用的预测体系结构会受到很大的计算复杂性的困扰,并且需要显着的内存大小。本文提出的架构基于一组调查,这些调查探索了不同视图间预测方向对多视图视频编码的编码效率的影响,除了分析块之外,还对不同抽取和内插方法进行了比较研究。匹配的统计信息。所提出的预测体系结构已与自适应参考帧排序算法集成在一起,从而为具有硬场景变化的多视点视频提供了一种有效的编码解决方案。本文包括对提议的体系结构与基于3D数字多媒体广播(3D-DMB)和Hierarchical B-Picture(HBP)体系结构的扩展体系结构的比较性能评估,这是MSR-MVC的两种最广泛使用的体系结构。评估实验表明,与相应的编解码器(在单场景编码时部署扩展的3D-DMB架构)相比,所建议的体系结构平均只需较少的比特率(平均13.1 Kbps),更少的编码时间和14%的内存消耗,以及31.6%的内存消耗。视频。此外,与使用HBP架构的相应编解码器相比,部署拟议架构的编解码器平均可将编码速度提高57%,所需内存减少52%。另一方面,与使用HBP体系结构的相应编解码器相比,使用所提出体系结构的多视图视频编码平均需要24.9 Kbps的更高比特率。与HBP和扩展的3D相比,对于编码场景变化较大的多视点视频,所建议的体系结构产生的比特率更低(平均28.7至35.4 Kbps),并且加快了编码时间(平均64%和33%)。 -DMB体系结构。因此,所提出的架构在低比特率应用中将是最有益的,该低比特率应用需要用于描述硬场景变化的视频内容的多视图视频编码。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号