...
首页> 外文期刊>IEEE Transactions on Pattern Analysis and Machine Intelligence >ASTER: An Attentional Scene Text Recognizer with Flexible Rectification
【24h】

ASTER: An Attentional Scene Text Recognizer with Flexible Rectification

机译:ASTER:具有灵活纠正功能的注意力场景文本识别器

获取原文
获取原文并翻译 | 示例
           

摘要

A challenging aspect of scene text recognition is to handle text with distortions or irregular layout. In particular, perspective text and curved text are common in natural scenes and are difficult to recognize. In this work, we introduce ASTER, an end-to-end neural network model that comprises a rectification network and a recognition network. The rectification network adaptively transforms an input image into a new one, rectifying the text in it. It is powered by a flexible Thin-Plate Spline transformation which handles a variety of text irregularities and is trained without human annotations. The recognition network is an attentional sequence-to-sequence model that predicts a character sequence directly from the rectified image. The whole model is trained end to end, requiring only images and their groundtruth text. Through extensive experiments, we verify the effectiveness of the rectification and demonstrate the state-of-the-art recognition performance of ASTER. Furthermore, we demonstrate that ASTER is a powerful component in end-to-end recognition systems, for its ability to enhance the detector.
机译:场景文本识别的一个具有挑战性的方面是处理具有扭曲或不规则布局的文本。特别是,透视文字和弯曲文字在自然场景中很常见,很难识别。在这项工作中,我们介绍了ASTER,这是一个由整流网络和识别网络组成的端到端神经网络模型。校正网络将输入图像自适应地转换为新图像,从而校正其中的文本。它由灵活的Thin-Plate Spline转换提供支持,该转换可处理各种文本不规则性,并且无需人工注释即可进行训练。识别网络是一个注意序列到序列模型,可以直接从校正后的图像中预测字符序列。整个模型是端到端训练的,仅需要图像及其真实文本。通过广泛的实验,我们验证了整流的有效性,并展示了ASTER的最新识别性能。此外,我们证明了ASTER能够增强检测器,因此是端到端识别系统中的强大组件。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号