首页> 外文会议>International Conference on Document Analysis and Recognition >Hypothesis Preservation Approach to Scene Text Recognition with Weighted Finite-State Transducer
【24h】

Hypothesis Preservation Approach to Scene Text Recognition with Weighted Finite-State Transducer

机译:对加权有限状态传感器的场景文本识别假设保存方法

获取原文

摘要

This paper shows that the use of Weighted Finite-State Transducer (WFST) significantly eliminates large-scale ambiguity in scene text recognition, especially for Japanese Kanji characters. The proposed method consists of two WFSTs called WFST-OCR and WFST-Lexicon. WFST-OCR handles the multiple hypotheses caused by erroneous text location, character segmentation and character recognition processes. The following WFST-Lexicon and its convolution of WFST-OCR resolve the hypotheses. The WFSTs integrate the conventional OCR and post-processing processes into one process. The benefit from the proposed method is that all the ambiguities are held as WFST data, and solved in one integrated step, the system outputs texts that are statistically consistent with regard to segmentation possibilities and the given language model. An experimental system demonstrates practical performance in spite of the hypothesis complexity inherent in the ICDAR test set and Kanji character texts.
机译:本文表明,使用加权有限状态换能器(WFST)显着消除了场景文本识别中的大规模模糊性,特别是对于日本汉字人物。该方法由两个名为WFST-OCR和WFST-Lexicon的WFST组成。 WFST-OCR处理由错误的文本位置,字符分段和字符识别过程引起的多个假设。以下WFST-Lexicon及其WFST-OCR的卷积解决了假设。 WFSTS将传统的OCR和后处理过程集成到一个过程中。从所提出的方法中的益处是所有的含糊不限作为WFST数据,并在一个集成步骤中解决,系统输出关于分割可能性和给定语言模型的统计上一致的文本。实验系统尽管ICDAR测试集和Kanji字符文本固有的假设复杂性,但实际情况表明了实际表现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号