首页> 外文会议>International Conference on Document Analysis and Recognition >A New Parallel Detection-Recognition Approach for End-to-End Scene Text Extraction
【24h】

A New Parallel Detection-Recognition Approach for End-to-End Scene Text Extraction

机译:端到端场景文本提取的并行检测识别新方法

获取原文

摘要

In this work, we present a new conceptually simple and flexible network for accurate scene text extraction, which handle text detection and text recognition concurrently. Apart from solving feature sharing by designing a unified network, we implement end-to-end training of the whole system by a set of new optimization strategies. More importantly, this method highlights its novel parallel detection-recognition structure, which constructs a loose connection between both detection and recognition. This loose connection is embodied in the definition of overall loss and derivative back propagation for the model parameter updating, which automatically balances the contribution of the two branches to the system performance. It is different from the existing end-to-end methods where two subtasks are connected serially and thus yielding heavy dependence of the predecessor text detection task on the follow-up text recognition task and sensitivity of recognition to detection noise. In addition, a simple Mask-Rectifier mechanism is applied to easily adapt our system to incidental text recognition with arbitrary orientation and shape. Experiment results on Incidental Scene Text ICDAR2015 dataset surpass the current state-of-the-art FOTS method, as suggests the effectiveness of the proposed approach.
机译:在这项工作中,我们提出了一种用于精确场景文本提取的概念上简单而灵活的新网络,该网络可以同时处理文本检测和文本识别。除了通过设计统一的网络解决功能共享之外,我们还通过一系列新的优化策略对整个系统进行端到端培训。更重要的是,该方法突出了其新颖的并行检测-识别结构,该结构在检测和识别之间构建了松散的联系。这种松散的连接体现在用于模型参数更新的总损耗和导数反向传播的定义中,该定义自动平衡两个分支对系统性能的贡献。它与现有的端到端方法不同,在现有的端到端方法中,两个子任务被串行连接,因此,先前的文本检测任务严重依赖于后续的文本识别任务,并且对检测噪声的识别敏感度很高。此外,还应用了简单的Mask-Rectifier机制来使我们的系统轻松适应具有任意方向和形状的附带文本识别。偶然场景文本ICDAR2015数据集的实验结果超过了当前最新的FOTS方法,表明了该方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号