首页> 外文会议>9th ACM/IEEE joint conference on digital libraries 2009 >Improving Optical Character Recognition through Efficient Multiple System Alignment
【24h】

Improving Optical Character Recognition through Efficient Multiple System Alignment

机译:通过高效的多系统对准改善光学字符识别

获取原文
获取原文并翻译 | 示例

摘要

Individual optical character recognition (OCR) engines vary in the types of errors they commit in recognizing text, particularly poor quality text. By aligning the output of multiple OCR engines and taking advantage of the differences between them, the error rate based on the aligned lattice of recognized words is significantly lower than the individual OCR word error rates. This lattice error rate constitutes a lower bound among aligned alternatives from the OCR output. Results from a collection of poor quality mid-twentieth century typewritten documents demonstrate an average reduction of 55.0% in the error rate of the lattice of alternatives and a realized word error rate (WER) reduction of 35.8% in a dictionary-based selection process. As an important precursor, an innovative admissible heuristic for the A* algorithm is developed, which results in a significant reduction in state space exploration to identify all optimal alignments of the OCR text output, a necessary step toward the construction of the word hypothesis lattice. On average 0.0079% of the state space is explored to identify all optimal alignments of the documents.
机译:各个光学字符识别(OCR)引擎在识别文本(尤其是质量较差的文本)时所犯的错误类型有所不同。通过对齐多个OCR引擎的输出并利用它们之间的差异,基于对齐的已识别单词晶格的错误率显着低于单个OCR单词错误率。此晶格错误率构成了OCR输出的对齐替代项的下限。收集自20世纪中叶质量不佳的打字文档的结果表明,在基于字典的选择过程中,替代格式的错误率平均降低了55.0%,实现的单词错误率(WER)降低了35.8%。作为重要的先驱,针对A *算法开发了一种创新的可允许启发式算法,它大大减少了状态空间探索以识别OCR文本输出的所有最佳对齐方式,这是构建单词假设格的必要步骤。平均探索状态空间的0.0079%,以识别文档的所有最佳对齐方式。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号