Towards Whole-Book Recognition

机译：走向全书识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We describe experimental results for unsupervised recognition of the textual contents of book-images using fully automatic mutual-entropy-based model adaptation. Each experiment starts with approximate {it iconic} and{it linguistic} models---derived from (generally errorful) OCR results and (generally incomplete) dictionaries---and then runs a fully automatic adaptation algorithm which, guided entirely by evidence internal to the test set, attempts to correct the models for improved accuracy. The iconic model describes image formation and determines the behavior of a character-image classifier. The linguistic model describes word-occurrence probabilities. Our adaptation algorithm detects disagreements between the models by analyzing mutual entropy between (1) the {em a posteriori} probability distribution of character classes (the recognition results from image classification alone), and (2) the {em a posteriori} probability distribution of word classes (the recognition results from image classification combined with linguistic constraints). Disagreements identify candidates for automatic model corrections. We report experiments on 40 textlines in which word error rates fall monotonicaly with passage lengths. We also report experiments on an enhanced algorithm which can cope with character-segmentation errors (a single split, or a single merge, per word). In order to scale up experiments, soon, to whole book images, we have revised data structures and implemented speed enhancements. For this algorithm, we report results on three increasingly long passage lengths: (a) one full page, (b) five pages, and (b) ten pages. We observe that error rates on long words fall monotonically with passage lengths.

机译：我们描述了基于全自动互熵的基于模型自适应的书本图像内容无监督识别的实验结果。每个实验都从近似的{it iconic}和{it语言学}模型开始-从（通常是错误的）OCR结果和（通常是不完整的）字典中提取模型-然后运行一个全自动的自适应算法，该算法完全由证据内部到测试集，尝试校正模型以提高准确性。图标模型描述图像的形成并确定字符图像分类器的行为。语言模型描述了单词出现的概率。我们的自适应算法通过分析（1）字符类的{后验概率}分布（仅来自图像分类的识别结果）与（2）字符集的后验概率分布之间的互熵来检测模型之间的分歧。单词类别（图像分类的识别结果结合语言限制）。分歧确定了自动模型更正的候选对象。我们报告了40条文本行的实验，其中单词错误率随着段落长度而单调下降。我们还报告了有关可解决字符分割错误（每个单词一个拆分或单个合并）的增强算法的实验。为了将实验规模扩大到整个书本图像，我们已经修改了数据结构并实施了速度增强功能。对于此算法，我们报告了三个越来越长的段落长度的结果：（a）一整页，（b）五页，和（b）十页。我们观察到长字的错误率随段落长度而单调下降。

著录项

来源
《Document Analysis Systems, DAS, 2008 Eighth IAPR Workshop on》||P.629-636|共8页
会议地点
作者
Xiu Pingping; Baird Henry S.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类工业技术;
关键词
adaptive classification; anytime algorithms; book recognition; document image recognition; isogeny; model adaptation; mutual entropy;

机译：自适应分类;随时算法;书本识别;文档图像识别;异构;模型自适应;互熵;

相似文献

外文文献
中文文献
专利

1. Whole-Book Recognition [J] . Xiu Pingping, Baird Henry S. Pattern Analysis and Machine Intelligence, IEEE Transactions on . 2012,第12期

机译：全书识别
2. Combating whole-book deterioration: the rebinding & mass deacidification program at the penn state university libraries [J] . L. Suzanne Kellerman Library Resources & Technical Services . 1999,第3期

机译：应对整本书恶化：宾夕法尼亚州立大学图书馆的重新绑定和大规模脱酸程序
3. Use of the recognition heuristic depends on the domain's recognition validity, not on the recognition validity of selected sets of objects [J] . Pohl Rudiger F., Michalkiewicz Martha, Erdfelder Edgar, Memory & cognition . 2017,第5期

机译：使用识别启发式依赖于域的识别有效性，而不是在所选对象集的识别有效性上
4. Clustering of Farsi Sub-word Images for Whole-book Recognition [C] . Mohammad Reza Soheili, Ehsanollah Kabir, Didier Stricker Document recognition and retrieval XXII . 2015

机译：波斯语子词图像聚类用于全书识别
5. Whole-book recognition. [D] . Xiu, Pingping. 2011

机译：全书识别。
6. Interaction of signal-recognition particle 54 GTPase domain and signal-recognition particle RNA in the free signal-recognition particle [O] . Tobias Hainzl, Shenghua Huang, A. Elisabeth Sauer-Eriksson 2007

机译：游离信号识别颗粒中信号识别颗粒54 GTPase结构域与信号识别颗粒RNA的相互作用
7. Incorporating Linguistic Post-Processing Into Whole-Book Recognition [O] . Pingping Xiu, Henry S. Baird 2011

机译：将语言后处理整合到全书识别中

Towards Whole-Book Recognition

摘要

著录项

相似文献

相关主题

期刊订阅