Combining Multi-scale Character Recognition and Linguistic Knowledge for Natural Scene Text OCR

机译：结合多尺度字符识别和语言知识自然场景文本OCR

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Understanding text captured in real-world scenes is a challenging problem in the field of visual pattern recognition and continues to generate a significant interest in the OCR (Optical Character Recognition) community. This paper proposes a novel method to recognize scene texts avoiding the conventional character segmentation step. The idea is to scan the text image with multi-scale windows and apply a robust recognition model, relying on a neural classification approach, to every window in order to recognize valid characters and identify non valid ones. Recognition results are represented as a graph model in order to determine the best sequence of characters. Some linguistic knowledge is also incorporated to remove errors due to recognition confusions. The designed method is evaluated on the ICDAR 2003 database of scene text images and outperforms state-of-the-art approaches.

机译：了解现实场景中捕获的文本是视野识别领域的一个具有挑战性的问题，并继续在OCR（光学字符识别）社区中产生重大兴趣。本文提出了一种识别场景文本避免传统字符分割步骤的新方法。该想法是用多尺度窗口扫描文本图像并应用一个鲁棒识别模型，依赖于神经分类方法，每个窗口才能识别有效字符并识别非有效字符。识别结果表示为图形模型，以便确定最佳字符序列。还包含一些语言知识，以消除由于识别混淆而导致的误差。在场景文本图像的ICDAR 2003数据库中评估了设计的方法，优于最先进的方法。

著录项

来源
《IAPR International Workshop on Document Analysis Systems》|2012年||共5页
会议地点
作者
Elagouni K.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP391-53;
关键词

相似文献

外文文献
中文文献
专利

1. Scene Text Recognition Using Structure-Guided Character Detection and Linguistic Knowledge [J] . Shi C., Wang C., Xiao B., Circuits and Systems for Video Technology, IEEE Transactions on . 2014,第7期

机译：基于结构引导字符检测和语言知识的场景文本识别
2. MA-CRNN: a multi-scale attention CRNN for Chinese text line recognition in natural scenes [J] . Tong Guofeng, Li Yong, Gao Huashuai, International Journal on Document Analysis and Recognition . 2020,第2期

机译：MA-CRNN：在自然场景中的中国文本线路识别的多种关注CRNN
3. Natural scene text detection by multi-scale adaptive color clustering and non-text filtering [J] . Wu Hui, Zou Beiji, Zhao Yu-Qian, Neurocomputing . 2016,第nova19期

机译：通过多尺度自适应颜色聚类和非文本过滤进行自然场景文本检测
4. Combining Multi-scale Character Recognition and Linguistic Knowledge for Natural Scene Text OCR [C] . Elagouni K. Document Analysis Systems (DAS), 2012 10th IAPR International Workshop on . 2012

机译：自然场景文本OCR的多尺度字符识别和语言知识的结合
5. A multimodal fusion approach for automatic postal address recognition system using Optical Character Recognition (OCR) and Automatic Speech Recognition (ASR) techniques. [D] . Singh, Amriteshwar. 2011

机译：一种使用光学字符识别（OCR）和自动语音识别（ASR）技术的自动邮政地址识别系统的多模式融合方法。
6. Cursive-Text: A Comprehensive Dataset for End-to-End Urdu Text Recognition in Natural Scene Images [O] . Asghar Ali Chandio, Md. Asikuzzaman, Mark Pickering, 2020

机译：草书文本：用于自然场景图像中端到端乌尔都语文本识别的综合数据集
7. Combining Multi-Scale Character Recognition and Linguistic Knowledge for Natural Scene Text OCR [O] . Elagouni, Khaoula, Garcia, Christophe, Mamalet, Franck, 2012

机译：结合多尺度字符识别和语言知识的自然场景文本OCR
8. Optical Character Recognition (OCR) Inks. Category: Hardware Standard. Subcategory: Character Recognition [R] . Owen, R. K. 1980

机译：光学字符识别（OCR）油墨。类别：硬件标准。子类别：字符识别

Combining Multi-scale Character Recognition and Linguistic Knowledge for Natural Scene Text OCR

摘要

著录项

相似文献

相关主题

期刊订阅