Improved localization accuracy by LocNet for Faster R-CNN based text detection in natural scene images

Zhong Zhuoyao; Sun Lei; Huo Qiang

首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >Improved localization accuracy by LocNet for Faster R-CNN based text detection in natural scene images

【24h】

Improved localization accuracy by LocNet for Faster R-CNN based text detection in natural scene images

机译：通过LOCNET提高本地化精度，以便在自然场景图像中更快的基于R-CNN的文本检测

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Although Faster R-CNN based text detection approaches have achieved promising results, their localization accuracy is not satisfactory in certain cases due to their sub-optimal bounding box regression based localization modules. In this paper, we address this problem and propose replacing the bounding box regression module with a novel LocNet based localization module to improve the localization accuracy of a Faster R-CNN based text detector. Given a proposal generated by a region proposal network (RPN), instead of directly predicting the bounding box coordinates of the concerned text instance, the proposal is enlarged to create a search region so that an "In-Out" conditional probability to each row and column of this search region is assigned, which can then be used to accurately infer the concerned bounding box. Furthermore, we present a simple yet effective two-stage approach to convert the difficult multi oriented text detection problem to a relatively easier horizontal text detection problem, which makes our approach able to robustly detect multi-oriented text instances with accurate bounding box localization. Experiments demonstrate that the proposed approach boosts the localization accuracy of Faster R-CNN based text detectors significantly. Consequently, our new text detector has achieved superior performance on both horizontal (ICDAR-2011, ICDAR-2013 and MULTILIGUL) and multi-oriented (MSRA-TD500, ICDAR-2015) text detection benchmark tasks. (C) 2019 Elsevier Ltd. All rights reserved.

机译：虽然基于R-CNN的文本检测方法更快地实现了有希望的结果，但它们的本地化精度在某些情况下不会令人满意，因为它们的子最优边界盒基于的本地化模块。在本文中，我们解决了这个问题，并提出了用基于新的基因网的定位模块更换边界框回归模块，以提高基于R-CNN的文本检测器的定位精度。给定由区域提案网络（RPN）生成的提议，而不是直接预测有关文本实例的边界框坐标，该提议扩大以创建搜索区域，以便为每行的“输入”条件概率分配了该搜索区域的列，然后可以使用该列来准确地推断有关边界。此外，我们提出了一种简单而有效的两阶段方法，将困难的多面向文本检测问题转换为相对容易的水平文本检测问题，这使得我们的方法能够通过准确的边界盒定位鲁棒地检测多面向文本实例。实验表明，所提出的方法显着提高了基于R-CNN基本探测器的定位精度。因此，我们的新文本探测器在水平（ICDAR-2011，ICDAR-2013和MultiLiligul）上实现了卓越的性能和多型（MSRA-TD500，ICDAR-2015）文本检测基准任务。（c）2019年elestvier有限公司保留所有权利。

著录项

来源
《Pattern Recognition: The Journal of the Pattern Recognition Society》 |2019年第2019期|共16页
作者
Zhong Zhuoyao; Sun Lei; Huo Qiang;
展开▼
作者单位

South China Univ Technol Sch Elect &

Informat Engn Guangzhou 510641 Guangdong Peoples R China;

Microsoft Res Asia Beijing 100080 Peoples R China;

Microsoft Res Asia Beijing 100080 Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
Text detection; Text localization accuracy; Faster R-CNN; LocNet; Natural scene images;

机译：文本检测;文本本地化精度;更快的R-CNN;LOCNET;自然场景图像;

相似文献

外文文献
中文文献
专利

1. Improved localization accuracy by LocNet for Faster R-CNN based text detection in natural scene images [J] . Zhong Zhuoyao, Sun Lei, Huo Qiang Pattern Recognition: The Journal of the Pattern Recognition Society . 2019,第期

机译：通过LOCNET提高本地化精度，以便在自然场景图像中更快的基于R-CNN的文本检测
2. Text detection and localization in natural scene images based on text awareness score [J] . Soni Rituraj, Kumar Bijendra, Chand Satish Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies . 2019,第4期

机译：基于文本意识分数的自然场景图像中的文本检测与本地化
3. A Study on Text Detection and Localization Techniques for Natural Scene Images [J] . Salahuddin Unar, Akhtar Hussain, Mohsin Shaikh, International journal of computer science and network security . 2018,第1期

机译：自然场景图像的文本检测与定位技术研究
4. Improved Localization Accuracy by LocNet for Faster R-CNN Based Text Detection [C] . Zhuoyao Zhong, Lei Sun, Qiang Huo IAPR International Conference on Document Analysis and Recognition . 2017

机译：LocNet改进的定位精度，用于基于R-CNN的更快文本检测
5. Faster R-CNN Based CubeSat Close Proximity Detection and Attitude Estimation [D] . Sujeewa Samarawickrama, N. G. I. 2019

机译：基于R-CNN的立方体近距离检测和姿态估计更快
6. Artificial Intelligence-Based Mitosis Detection in Breast Cancer Histopathology Images Using Faster R-CNN and Deep CNNs [O] . Tahir Mahmood, Muhammad Arsalan, Muhammad Owais, 2020

机译：使用更快的R-CNN和深CNN在乳腺癌组织病理学图像中基于人工智能的有丝分裂检测
7. An improved edge profile based method for text detection in images of natural scenes [O] . Ikica Andrej, Peer Peter 2011

机译：一种改进的基于边缘轮廓的自然场景图像文本检测方法

Improved localization accuracy by LocNet for Faster R-CNN based text detection in natural scene images

摘要

著录项

相似文献

相关主题

期刊订阅