首页> 外文会议>Conference on storage and retrieval for image and video databases >Illumination-invariant video segmentation by hierarchical robust thresholding
【24h】

Illumination-invariant video segmentation by hierarchical robust thresholding

机译:通过分级鲁棒阈值进行照明不变视频分割

获取原文

摘要

Abstract: Many methods for video segmentation rely upon the setting and tuning of thresholds for classifying interframe distances under various difference measures. An approach that has been used with some success has been to establish statistical measures for each new video and identify camera cuts as difference values far from the mean. For this type of strategy the mean and dispersion for some interframe distance measure must be calculated for each new video as a whole. Here we eliminate this statistical characterization step and at the same time allow for segmentation of streaming video by introducing a preprocessing step for illumination-invariance that concomitantly reduces input values to a uniform scale. The preprocessing step provides a solution to the problem that simple changes of illumination in a scene, such as an actor emerging from a shadow, can trigger a false positive transition, no matter whether intensity alone or chrominance is used in a distance measure. Our means of discounting lighting change for color constancy consists of the simple yet effective operation of normalizing each color channel to length 1 (when viewed as a long, length-N vector). We then reduce the dimensionality of color to two-dimensional chromaticity, with values which are in 0..1. Chromaticity histograms can be treated as images, and effectively low-pass filtered by wavelet-based reduction, followed by DCT and zonal coding. This results in an indexing scheme based on only 36 numbers, and lends itself to a binary search approach to transition detection. To this end we examine distributions for intra-clip and inter-clip distances separately, characterizing each using robust statistics, for temporal intervals from 32 frames to 1 frame by powers of 2. Then combining transition and non-transition distributions for each frame internal, we seek the valley between them, again robustly, for each threshold. Using the present method values of precision and recall are increased over previous methods. Moreover, illumination change produces very few false positives. !15
机译:摘要:许多视频分割方法都依赖于阈值的设置和调整,以在各种差异措施下对帧间距离进行分类。已经成功使用的一种方法是为每个新视频建立统计量度,并将摄像机剪辑标识为远离均值的差值。对于这种类型的策略,必须为每个新视频整体计算一些帧间距离度量的均值和离散度。在这里,我们通过引入用于照明不变性的预处理步骤来消除流统计视频的这一统计表征步骤,同时允许将流视频分割,从而将输入值降低到统一的范围。预处理步骤为以下问题提供了解决方案:无论在距离度量中是单独使用强度还是使用色度,场景中照明的简单变化(例如从阴影中出现的演员)都可能触发假阳性过渡。我们为保持色彩恒定而减少照明变化的方法包括简单而有效的操作,将每个颜色通道归一化为长度1(当视为长的,长度为N的矢量时)。然后,我们将颜色的维数减少为二维色度,其值在0..1中。色度直方图可以被视为图像,并且可以通过基于小波的归约,随后的DCT和区域编码有效地进行低通滤波。这导致仅基于36个数字的索引方案,并使其自身适合用于过渡检测的二进制搜索方法。为此,我们分别检查了片段内和片段间距离的分布,使用稳健的统计数据表征了从32帧到1帧的时间间隔(以2的幂)的特征。然后结合内部每帧的过渡和非过渡分布,对于每个阈值,我们再次稳健地寻找它们之间的山谷。使用本方法,精度和查全率的值比以前的方法要高。此外,照度变化几乎不会产生误报。 !15

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号