首页> 外文期刊>IEEE Transactions on Circuits and Systems for Video Technology >Optimal Region Selection for Stereoscopic Video Subtitle Insertion
【24h】

Optimal Region Selection for Stereoscopic Video Subtitle Insertion

机译:立体视频字幕插入的最佳区域选择

获取原文
获取原文并翻译 | 示例

摘要

Stereoscopic subtitle insertion is a fundamental and essential element in stereoscopic film and TV industry. However, little work has been dedicated to the optimal region selection for stereoscopic subtitle insertion. In addition, there is no public database reported for the performance evaluation of it. In this paper, we build the first large-scale video database (TJU3D) for stereoscopic video subtitle insertion, which includes 50 video sequences with rich screen scenes. Compared with 2D subtitle region selection, there are several problems we have to consider in stereoscopic subtitle region selection: 1) the subtitle should avoid depth cue collision and occlusion from objects in stereoscopic video sequences; 2) the disparity value of the subtitle must be minimized to reduce visual discomfort; and 3) the temporal coherence constraint must be considered during region selection for subtitles in video sequences. By considering these constraints, we propose an optimal region selection algorithm for stereoscopic subtitle insertion. First, we compute the disparity map of each video frame in video sequences. For each frame, the optimal position and disparity value of the subtitle are determined by a subtitle region selection algorithm, which contains two parts (i.e., the coarse selection and fine selection). After that, by considering the temporal consistency between adjacent frames, the position and disparity value of each frame are further classified and processed in order to avoid the subtitle jitter. We evaluate the proposed method on TJU3D video database through two visual discomfort prediction metrics and one subjective experiment. To further verify the effectiveness of the proposed method, we also validate the performance of the proposed method on video comfort assessment database, i.e., IEEE-SA Stereo Database. Experimental results demonstrate that the visual discomfort is greatly reduced when using the proposed method compared with the basic method.
机译:立体字幕的插入是立体影视行业的基本要素。但是,很少有工作致力于立体字幕插入的最佳区域选择。此外,没有报告用于评估其性能的公共数据库。在本文中,我们建立了第一个用于立体视频字幕插入的大型视频数据库(TJU3D),其中包括50个具有丰富屏幕场景的视频序列。与2D字幕区域选择相比,在立体字幕区域选择中我们要考虑几个问题:1)字幕应避免立体视频序列中对象的深度提示碰撞和遮挡; 2)必须将字幕的视差值减到最小,以减少视觉不适感; 3)在视频序列中字幕的区域选择期间必须考虑时间一致性约束。通过考虑这些约束,我们提出了一种用于立体字幕插入的最佳区域选择算法。首先,我们计算视频序列中每个视频帧的视差图。对于每一帧,字幕的最佳位置和视差值由字幕区域选择算法确定,该算法包含两个部分(即粗略选择和精细选择)。之后,通过考虑相邻帧之间的时间一致性,进一步对每个帧的位置和视差值进行分类和处理以避免字幕抖动。通过两个视觉不适预测指标和一个主观实验,我们在TJU3D视频数据库上评估了该方法。为了进一步验证该方法的有效性,我们还在视频舒适度评估数据库(即IEEE-SA立体声数据库)上验证了该方法的性能。实验结果表明,与基本方法相比,该方法大大降低了视觉不适。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号