Improving Book OCR by Adaptive Language and Image Models

机译：通过自适应语言和图像模型改进书籍OCR

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In order to cope with the vast diversity of book content and typefaces, it is important for OCR systems to leverage the strong consistency within a book but adapt to variations across books. We describe a system that combines two parallel correction paths using document-specific image and language models. Each model adapts to shapes and vocabularies within a book to identify inconsistencies as correction hypotheses, but relies on the other for effective cross-validation. Using the open source Tesseract engine as baseline, results on a large data set of scanned books demonstrate that word error rates can be reduced by 25 percent using this approach.

机译：为了应对巨大的书籍内容和字体，对于OCR系统来说，重要的是利用书中的强烈一致性，而是适应书籍的变化。我们描述了一种使用特定于文档的图像和语言模型组合两个并行校正路径的系统。每个模型都适应书中的形状和词汇，以确定不一致的校正假设，但依赖于另一个用于有效的交叉验证。使用开源TESSERACT引擎作为基线，结果在大型数据集的扫描书籍上表明，使用这种方法可以减少25％的错误误差率。

著录项

来源
《IAPR International Workshop on Document Analysis Systems》|2012年||共5页
会议地点
作者
Dar-Shyang Lee;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP391-53;
关键词

相似文献

外文文献
中文文献
专利

1. Correcting Arabic OCR Errors Using Improved Topic-Based Language Models [J] . Safeya Mamish, Mohamed Cheriet International journal of computer processing of languages . 2009,第4期

机译：使用改进的基于主题的语言模型纠正阿拉伯语OCR错误
2. Script Segmentation of Printed Devnagari and Bangla Languages Document Images OCR [J] . International Journal of Computer Science and Technology . 2011,第2期

机译：印刷的天语和孟加拉语言文档图像OCR的脚本分割
3. OCR with the Deep CNN Model for Ligature Script-Based Languages like Manchu [J] . Diandian Zhang, Yan Liu, Zhuowei Wang, Scientific programming . 2021,第a期

机译：OCR与深入的CNN模型，即用于满族的结扎脚本语言
4. Improving Book OCR by Adaptive Language and Image Models [C] . Dar-Shyang Lee Document Analysis Systems (DAS), 2012 10th IAPR International Workshop on . 2012

机译：通过自适应语言和图像模型改善图书OCR
5. A study of Peter Huchel: The cyclic structures in his books of poems, the theme of “sign” and “language”, and the images of ethnic minorities [D] . Sugiura, Kensuke 2004

机译：彼得·休歇尔研究：他的诗歌作品中的循环结构，“符号”和“语言”的主题以及少数民族的形象
6. Use of natural language processing to improve predictive models for imaging utilization in children presenting to the emergency department [O] . Xingyu Zhang, M. Fernanda Bellolio, Pau Medrano-Gracia, 2019

机译：利用自然语言处理来改善预测模型以便为急诊科的儿童提供影像学应用
7. Improving Book OCR by Adaptive Language and Image Models [O] . Dar-shyang Lee, Google Inc, Ray Smith 2013

机译：用自适应语言和图像模型改进书籍OCR

Improving Book OCR by Adaptive Language and Image Models

摘要

著录项

相似文献

相关主题

期刊订阅