首页> 外文会议>Conference on storage and retrieval for image and video databases >Illumination-invariant video segmentation by hierarchical robust thresholding
【24h】

Illumination-invariant video segmentation by hierarchical robust thresholding

机译:通过分层强大的阈值平衡照明 - 不变视频分割

获取原文

摘要

Many methods for video segmentation rely upon the setting and tuning of thresholds for classifying interframe distances under various difference measures. An approach that has been used with some success has been to establish statistical measures for each new video and identify camera cuts as difference values far from the mean. For this type of strategy the mean and dispersion for some interframe distance measure must be calculated for each new video as a whole. Here we eliminate this statistical characterization step and at the same time allow for segmentation of streaming video by introducing a preprocessing step for illumination-invariance that concomitantly reduces input values to a uniform scale. The preprocessing step provides a solution to the problem that simple changes of illumination in a scene, such as an actor emerging from a shadow, can trigger a false positive transition, no matter whether intensity alone or chrominance is used in a distance measure. Our means of discounting lighting change for color constancy consists of the simple yet effective operation of normalizing each color channel to length 1 (when viewed as a long, length-N vector). We then reduce the dimensionality of color to two-dimensional chromaticity, with values which are in 0..1. Chromaticity histograms can be treated as images, and effectively low-pass filtered by wavelet-based reduction, followed by DCT and zonal coding. This results in an indexing scheme based on only 36 numbers, and lends itself to a binary search approach to transition detection. To this end we examine distributions for intra-clip and inter-clip distances separately, characterizing each using robust statistics, for temporal intervals from 32 frames to 1 frame by powers of 2. Then combining transition and non-transition distributions for each frame internal, we seek the valley between them, again robustly, for each threshold. Using the present method values of precision and recall are increased over previous methods. Moreover, illumination change produces very few false positives.
机译:许多用于视频分割的方法依赖于在各种差别测量下对分类帧间距离进行分类的阈值的设置和调整。已经成功使用的方法是为每个新视频建立统计措施,并将摄像机剪辑为差值远离平均值。对于这种类型的策略,必须为整个新视频计算一些帧间距离测量的均值和分散。在这里,我们消除了该统计表征步骤,同时允许通过引入用于照明 - 不变性的预处理步骤来分割流式视频,该不应达到将输入值降低到均匀规模。预处理步骤提供了一个解决方案,即在场景中的诸如从阴影中出现的演员的简单变化的问题,可以触发错误的阳性过渡,无论是否在距离测量中使用强度或色度。我们的折扣灯光变化的手段颜色恒定的变化包括将每个颜色通道的简单且有效的操作组成,每个颜色通道向长度1(当被视为长的长度N向量时)。然后,我们将颜色的维度降低到二维色度,其中值为0..1。色度直方图可以被视为图像,并有效地通过基于小波的减少滤波的低通,然后是DCT和Zonal编码。这导致仅基于36个数字的索引方案,并将其自身归因于转换检测的二进制搜索方法。为此,我们分别检查内部夹子和夹间距的分布,表征每个使用稳健的统计数据,以便通过2.然后将每个帧的转换和非转换分布组合到3帧的时间间隔。我们为每个门槛再次努力地寻求它们之间的山谷。使用本方法的精度和召回的值在先前的方法上增加。此外,照明变化产生极少的误报。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号