Hypothesis Preservation Approach to Scene Text Recognition with Weighted Finite-State Transducer

机译：对加权有限状态传感器的场景文本识别假设保存方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper shows that the use of Weighted Finite-State Transducer (WFST) significantly eliminates large-scale ambiguity in scene text recognition, especially for Japanese Kanji characters. The proposed method consists of two WFSTs called WFST-OCR and WFST-Lexicon. WFST-OCR handles the multiple hypotheses caused by erroneous text location, character segmentation and character recognition processes. The following WFST-Lexicon and its convolution of WFST-OCR resolve the hypotheses. The WFSTs integrate the conventional OCR and post-processing processes into one process. The benefit from the proposed method is that all the ambiguities are held as WFST data, and solved in one integrated step, the system outputs texts that are statistically consistent with regard to segmentation possibilities and the given language model. An experimental system demonstrates practical performance in spite of the hypothesis complexity inherent in the ICDAR test set and Kanji character texts.

机译：本文表明，使用加权有限状态换能器（WFST）显着消除了场景文本识别中的大规模模糊性，特别是对于日本汉字人物。该方法由两个名为WFST-OCR和WFST-Lexicon的WFST组成。 WFST-OCR处理由错误的文本位置，字符分段和字符识别过程引起的多个假设。以下WFST-Lexicon及其WFST-OCR的卷积解决了假设。 WFSTS将传统的OCR和后处理过程集成到一个过程中。从所提出的方法中的益处是所有的含糊不限作为WFST数据，并在一个集成步骤中解决，系统输出关于分割可能性和给定语言模型的统计上一致的文本。实验系统尽管ICDAR测试集和Kanji字符文本固有的假设复杂性，但实际情况表明了实际表现。

著录项

来源
《International Conference on Document Analysis and Recognition》|2011年||共5页
会议地点
作者
Yamazoe Takafumi; Etoh Minoru; Yoshimura Takeshi; Tsujino Kousuke;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP391.41-53;
关键词
Kanji character; WFST; character recognition; natural scene; scene text; text extraction;

机译：汉字人物;wfst;字符识别;自然场景;场景文本;文本提取;

相似文献

外文文献
中文文献
专利

1. Weighted finite-state transducers for normalization of historical texts [J] . Etxeberria Izaskun, Alegria Inaki, Uria Larraitz Natural language engineering . 2019,第PTa2期

机译：用于历史文本归一化的加权有限状态传感器
2. Structural Classification Methods Based on Weighted Finite-State Transducers for Automatic Speech Recognition [J] . Kubo Y., Watanabe S., Hori T., Audio, Speech, and Language Processing, IEEE Transactions on . 2012,第8期

机译：基于加权有限状态传感器的语音识别结构分类方法
3. Learning a Discriminative Weighted Finite-State Transducer for Speech Recognition [J] . Lehr M., Shafran I. Audio, Speech, and Language Processing, IEEE Transactions on . 2011,第5期

机译：学习用于语音识别的判别加权有限状态传感器
4. Hypothesis Preservation Approach to Scene Text Recognition with Weighted Finite-State Transducer [C] . Yamazoe Takafumi, Etoh Minoru, Yoshimura Takeshi, 2011 International Conference on Document Analysis and Recognition . 2011

机译：假设有限元加权的场景传感器文本识别方法
5. Flexible speech synthesis using weighted finite-state transducers. [D] . Bulyko, Ivan. 2002

机译：使用加权有限状态换能器的灵活语音合成。
6. Cursive-Text: A Comprehensive Dataset for End-to-End Urdu Text Recognition in Natural Scene Images [O] . Asghar Ali Chandio, Md. Asikuzzaman, Mark Pickering, 2020

机译：草书文本：用于自然场景图像中端到端乌尔都语文本识别的综合数据集
7. Weighted finite-state transducers in speech recognition : a compaction algorithm for non-determinizable transducers [O] . Zhang Shouwen 2002

机译：语音识别中的加权有限状态换能器：不可确定换能器的压缩算法

Hypothesis Preservation Approach to Scene Text Recognition with Weighted Finite-State Transducer

摘要

著录项

相似文献

相关主题

期刊订阅