首页> 外文期刊>Neurocomputing >Scene video text tracking based on hybrid deep text detection and layout constraint
【24h】

Scene video text tracking based on hybrid deep text detection and layout constraint

机译:基于混合深度文本检测和布局约束的场景视频文本跟踪

获取原文
获取原文并翻译 | 示例
           

摘要

Video text in real-world scenes often carries rich high-level semantic information and plays an ever-increasingly important role in the content-based video analysis and retrieval. Therefore, the scene video text detection and tracking are important prerequisites of numerous multimedia applications. However, the performance of most existing tracking methods is not satisfactory due to frequent mis-detections, unexpected camera motion and similar appearances between text regions. To address these problems, we propose a new video text tracking approach based on hybrid deep text detection and layout constraint. Firstly, a deep text detection network that combines the advantages of object detection and semantic segmentation in a hybrid way is proposed to locate possible text candidates in individual frames. Then, text trajectories are derived from consecutive frames with a novel data association method, which effectively exploits the layout constraint of text regions in large camera motion. By utilizing the layout constraint, the ambiguities caused by similar text regions are effectively reduced. We conduct experiments on four benchmark datasets, i.e., ICDAR 2015, MSRA-TD 500, USTB-SV1K and Minetto, to evaluate the proposed method. The experimental results demonstrate the effectiveness and superiority of the proposed approach. (C) 2019 Elsevier B.V. All rights reserved.
机译:真实场景中的视频文本通常包含丰富的高级语义信息,并且在基于内容的视频分析和检索中扮演着越来越重要的角色。因此,场景视频文本的检测和跟踪是众多多媒体应用的重要前提。但是,由于频繁的错误检测,意外的相机运动以及文本区域之间的相似外观,大多数现有跟踪方法的性能并不令人满意。为了解决这些问题,我们提出了一种新的基于混合深度文本检测和布局约束的视频文本跟踪方法。首先,提出了一种深度文本检测网络,该网络以混合方式结合了对象检测和语义分割的优点,可以在各个帧中定位可能的文本候选。然后,通过一种新颖的数据关联方法从连续的帧中得出文本轨迹,该方法有效地利用了大型摄像机运动中文本区域的布局约束。通过利用布局约束,有效地减少了由相似文本区域引起的歧义。我们在四个基准数据集上进行了实验,即ICDAR 2015,MSRA-TD 500,USTB-SV1K和Minetto,以评估该方法。实验结果证明了该方法的有效性和优越性。 (C)2019 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号