首页> 外文会议>IAPR International Workshop on Document Analysis Systems >Combining Multi-scale Character Recognition and Linguistic Knowledge for Natural Scene Text OCR
【24h】

Combining Multi-scale Character Recognition and Linguistic Knowledge for Natural Scene Text OCR

机译:结合多尺度字符识别和语言知识自然场景文本OCR

获取原文

摘要

Understanding text captured in real-world scenes is a challenging problem in the field of visual pattern recognition and continues to generate a significant interest in the OCR (Optical Character Recognition) community. This paper proposes a novel method to recognize scene texts avoiding the conventional character segmentation step. The idea is to scan the text image with multi-scale windows and apply a robust recognition model, relying on a neural classification approach, to every window in order to recognize valid characters and identify non valid ones. Recognition results are represented as a graph model in order to determine the best sequence of characters. Some linguistic knowledge is also incorporated to remove errors due to recognition confusions. The designed method is evaluated on the ICDAR 2003 database of scene text images and outperforms state-of-the-art approaches.
机译:了解现实场景中捕获的文本是视野识别领域的一个具有挑战性的问题,并继续在OCR(光学字符识别)社区中产生重大兴趣。 本文提出了一种识别场景文本避免传统字符分割步骤的新方法。 该想法是用多尺度窗口扫描文本图像并应用一个鲁棒识别模型,依赖于神经分类方法,每个窗口才能识别有效字符并识别非有效字符。 识别结果表示为图形模型,以便确定最佳字符序列。 还包含一些语言知识,以消除由于识别混淆而导致的误差。 在场景文本图像的ICDAR 2003数据库中评估了设计的方法,优于最先进的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号