首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >End-to-end scene text recognition using tree-structured models
【24h】

End-to-end scene text recognition using tree-structured models

机译:使用树结构模型的端到端场景文本识别

获取原文
获取原文并翻译 | 示例
       

摘要

Detecting and recognizing text in natural images are quite challenging and have received much attention from the computer vision community in recent years. In this paper, we propose a robust end-to-end scene text recognition method, which utilizes tree-structured character models and normalized pictorial structured word models. For each category of characters, we build a part-based tree-structured model (TSM) so as to make use of the character-specific structure information as well as the local appearance information. The TSM could detect each part of the character and recognize the unique structure as well, seamlessly combining character detection and recognition together. As the TSMs could accurately detect characters from complex background, for text localization, we apply TSMs for all the characters on the coarse text detection regions to eliminate the false positives and search the possible missing characters as well. While for word recognition, we propose a normalized pictorial structure (PS) framework to deal with the bias caused by words of different lengths. Experimental results on a range of challenging public datasets (ICDAR 2003, ICDAR 2011, SVT) demonstrate that the proposed method outperforms state-of- the-art methods both for text localization and word recognition.
机译:在自然图像中检测和识别文本非常具有挑战性,近年来受到计算机视觉界的广泛关注。在本文中,我们提出了一种鲁棒的端到端场景文本识别方法,该方法利用树形结构的字符模型和归一化的图形化结构的词模型。对于每种字符类别,我们建立一个基于零件的树结构模型(TSM),以便利用特定于字符的结构信息以及局部外观信息。 TSM可以检测字符的每个部分并识别唯一的结构,将字符检测和识别无缝结合在一起。由于TSM可以准确地从复杂的背景中检测字符,因此对于文本定位,我们将TSM应用于粗糙文本检测区域上的所有字符,以消除误报并搜索可能的丢失字符。在进行单词识别时,我们提出了一种归一化的图片结构(PS)框架,以应对由不同长度的单词引起的偏差。在一系列具有挑战性的公共数据集(ICDAR 2003,ICDAR 2011,SVT)上的实验结果表明,该方法在文本本地化和单词识别方面都优于最新方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号