...
首页> 外文期刊>Journal of visual communication & image representation >Fast perceptual region tracking with coding-depth sensitive access for stream transcoding
【24h】

Fast perceptual region tracking with coding-depth sensitive access for stream transcoding

机译:快速感知区域跟踪,具有对流转码的编码深度敏感访问

获取原文
获取原文并翻译 | 示例

摘要

Object-based bit allocation can result in significant improvement in the perceptual quality of extremely compressed video. However, real-time video object detection in large format high fidelity video is computationally daunting. Most algorithms begin with extensive use of classical bit analysis, and thus remain computationally heavy. Based on some recent results in human visual perception, in this paper, we present an experimental visual region tracking algorithm particularly designed for perceptual stream transcoding. This exploits the cue order observed in human visual perception to achieve very high computation speed as well as tracking efficiency. Rather than begin processing from pixel level or using any pixel level processing at all, it employs high level motion cue and block shape cue analysis to identify signatures of various relative movements between object of interest, scene background and the camera on the motion vector set, and from there it identifies objects. It then uses predictive filters to track the regions. The result is a fast yet highly effective perceptual region tracking algorithm that can operate in stream rate and track regions of perceptually significant object despite camera movements such as zoom, panning and translation. The technique is not specific to any special class of objects. We have implemented this algorithm in a live ISO-13818/MPEG-2 perceptual transcoder. In this paper, we share the performance of this implementation. This fast object-aware video rate transcoder is particularly suitable for live streaming and can convert a regular stream into a perceptually coded video stream.
机译:基于对象的位分配可以极大地改善极端压缩视频的感知质量。然而,大型高保真视频中的实时视频对象检测在计算上令人生畏。大多数算法从广泛使用经典位分析开始,因此在计算上仍然很繁重。基于人类视觉感知的一些最新结果,在本文中,我们提出了一种专门设计用于感知流转码的实验性视觉区域跟踪算法。这利用了在人类视觉感知中观察到的提示顺序,以实现非常高的计算速度以及跟踪效率。它不是从像素级开始处理,也没有使用任何像素级处理,而是采用高级运动提示和块形状提示分析来识别运动矢量集上感兴趣对象,场景背景和摄像机之间各种相对运动的签名,从那里识别物体。然后,它使用预测性过滤器来跟踪区域。结果是一种快速而高效的感知区域跟踪算法,该算法可在流率下运行并跟踪感知重要对象的区域,而不管诸如变焦,平移和平移之类的相机移动。该技术并不特定于任何特殊类别的对象。我们已经在实时的ISO-13818 / MPEG-2感知转码器中实现了该算法。在本文中,我们分享了此实现的性能。这种快速的基于对象的视频速率转码器特别适用于实时流传输,并且可以将常规流转换为可感知编码的视频流。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号