首页> 外文会议>International Conference on Pattern Recognition >R2 CNN: Rotational Region CNN for Arbitrarily-Oriented Scene Text Detection
【24h】

R2 CNN: Rotational Region CNN for Arbitrarily-Oriented Scene Text Detection

机译:R 2 CNN:旋转区域CNN,用于任意定向的场景文本检测

获取原文

摘要

Scene text detection is challenging as the input may have different orientations, sizes, font styles, lighting conditions, perspective distortions and languages. This paper addresses the problem by designing a Rotational Region CNN (R2CNN). R2CNN includes a Text Region Proposal Network (Text-RPN) to estimate approximate text regions and a multitask refinement network to get the precise inclined box. Our work has the following features. First, we use a novel multi-task regression method to support arbitrarily-oriented scene text detection. Second, we introduce multiple ROIPoolings to address the scene text detection problem for the first time. Third, we use an inclined Non-Maximum Suppression (NMS) to post-process the detection candidates. Experiments show that our method outperforms the state-of-the-art on standard benchmarks: ICDAR 2013, ICDAR 2015, COCO-Text and MSRA-TD500.
机译:场景文本检测具有挑战性,因为输入可能具有不同的方向,大小,字体样式,照明条件,透视图失真和语言。本文通过设计旋转区域CNN(R 2 CNN)。 [R 2 CNN包括一个用于估计近似文本区域的文本区域提议网络(Text-RPN)和一个用于获得精确倾斜框的多任务优化网络。我们的工作具有以下特点。首先,我们使用一种新颖的多任务回归方法来支持面向任意方向的场景文本检测。其次,我们首次引入了多个ROIPooling,以解决场景文本检测问题。第三,我们使用倾斜的非最大抑制量(NMS)对检测候选物进行后处理。实验表明,在标准基准测试中,我们的方法优于最新技术:ICDAR 2013,ICDAR 2015,COCO-Text和MSRA-TD500。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号