【24h】

Towards Whole-Book Recognition

机译:走向全书识别

获取原文

摘要

We describe experimental results for unsupervised recognition of the textual contents of book-images using fully automatic mutual-entropy-based model adaptation. Each experiment starts with approximate {it iconic} and{it linguistic} models---derived from (generally errorful) OCR results and (generally incomplete) dictionaries---and then runs a fully automatic adaptation algorithm which, guided entirely by evidence internal to the test set, attempts to correct the models for improved accuracy. The iconic model describes image formation and determines the behavior of a character-image classifier. The linguistic model describes word-occurrence probabilities. Our adaptation algorithm detects disagreements between the models by analyzing mutual entropy between (1) the {em a posteriori} probability distribution of character classes (the recognition results from image classification alone), and (2) the {em a posteriori} probability distribution of word classes (the recognition results from image classification combined with linguistic constraints). Disagreements identify candidates for automatic model corrections. We report experiments on 40 textlines in which word error rates fall monotonicaly with passage lengths. We also report experiments on an enhanced algorithm which can cope with character-segmentation errors (a single split, or a single merge, per word). In order to scale up experiments, soon, to whole book images, we have revised data structures and implemented speed enhancements. For this algorithm, we report results on three increasingly long passage lengths: (a) one full page, (b) five pages, and (b) ten pages. We observe that error rates on long words fall monotonically with passage lengths.
机译:我们描述了基于全自动互熵的基于模型自适应的书本图像内容无监督识别的实验结果。每个实验都从近似的{it iconic}和{it语言学}模型开始-从(通常是错误的)OCR结果和(通常是不完整的)字典中提取模型-然后运行一个全自动的自适应算法,该算法完全由证据内部到测试集,尝试校正模型以提高准确性。图标模型描述图像的形成并确定字符图像分类器的行为。语言模型描述了单词出现的概率。我们的自适应算法通过分析(1)字符类的{后验概率}分布(仅来自图像分类的识别结果)与(2)字符集的后验概率分布之间的互熵来检测模型之间的分歧。单词类别(图像分类的识别结果结合语言限制)。分歧确定了自动模型更正的候选对象。我们报告了40条文本行的实验,其中单词错误率随着段落长度而单调下降。我们还报告了有关可解决字符分割错误(每个单词一个拆分或单个合并)的增强算法的实验。为了将实验规模扩大到整个书本图像,我们已经修改了数据结构并实施了速度增强功能。对于此算法,我们报告了三个越来越长的段落长度的结果:(a)一整页,(b)五页,和(b)十页。我们观察到长字的错误率随段落长度而单调下降。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号