首页> 外文会议>Conference on real-time image and video processing >Deriving video content type from HEVC bitstream semantics
【24h】

Deriving video content type from HEVC bitstream semantics

机译:从HEVC比特流语义派生视频内容类型

获取原文

摘要

As network service providers seek to improve customer satisfaction and retention levels, they are increasingly moving from traditional quality of service (QoS) driven delivery models to customer-centred quality of experience (QoE) delivery models. QoS models only consider metrics derived from the network however, QoE models also consider metrics derived from within the video sequence itself. Various spatial and temporal characteristics of a video sequence have been proposed, both individually and in combination, to derive methods of classifying video content either on a continuous scale or as a set of discrete classes. QoE models can be divided into three broad categories, full reference, reduced reference and no-reference models. Due to the need to have the original video available at the client for comparison, full reference metrics are of limited practical value in adaptive real-time video applications. Reduced reference metrics often require metadata to be transmitted with the bitstream, while no-reference metrics typically operate in the decompressed domain at the client side and require significant processing to extract spatial and temporal features. This paper proposes a heuristic, no-reference approach to video content classification which is specific to HEVC encoded bitstreams. The HEVC encoder already makes use of spatial characteristics to determine partitioning of coding units and temporal characteristics to determine the splitting of prediction units. We derive a function which approximates the spatio-temporal characteristics of the video sequence by using the weighted averages of the depth at which the coding unit quadtree is split and the prediction mode decision made by the encoder to estimate spatial and temporal characteristics respectively. Since the video content type of a sequence is determined by using high level information parsed from the video stream, spatio-temporal characteristics are identified without the need for full decoding and can be used in a timely manner to aid decision making in QoE oriented adaptive real time streaming.
机译:随着网络服务提供商寻求提高客户满意度和保留水平,他们正日益从传统的服务质量(QoS)驱动的交付模型过渡到以客户为中心的体验质量(QoE)交付模型。 QoS模型仅考虑源自网络的指标,但是QoE模型也考虑源自视频序列本身的指标。已经提出了视频序列的各种空间和时间特性,无论是单独地还是组合地使用,以得出以连续尺度或一组离散类别对视频内容进行分类的方法。 QoE模型可以分为三大类:完全参考模型,简化参考模型和无参考模型。由于需要在客户端提供原始视频进行比较,因此完整的参考指标在自适应实时视频应用中的实用价值有限。减少的参考指标通常要求将元数据与比特流一起发送,而无参考指标通常在客户端的解压缩域中运行,并且需要大量处理以提取空间和时间特征。本文提出了一种启发式,无参考的视频内容分类方法,该方法专门针对HEVC编码的比特流。 HEVC编码器已经利用空间特性来确定编码单元的划分,并利用时间特性来确定预测单元的划分。我们使用分割编码单元四叉树的深度的加权平均值和编码器做出的预测模式决策来分别估算空间和时间特性,从而得出近似视频序列的时空特性的函数。由于序列的视频内容类型是通过使用从视频流中解析出的高级信息来确定的,因此可以识别时空特性,而无需完全解码,并且可以及时使用,以帮助面向面向QoE的自适应实境决策。时间流。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号