首页> 外国专利> SYSTEM FOR OPTICAL CHARACTER RECOGNITION (OCR)

SYSTEM FOR OPTICAL CHARACTER RECOGNITION (OCR)

机译:光学字符识别系统(OCR)

摘要

A system (1) for optical character recognition (OCR) from an image containing text shall minimize transcription errors and improve the quality of the resulting machine-encoded text. To this end, the system comprises:- a text area dispatcher module (6), connected to a collection (8) of OCR engines (aTR1, aTR2,...aTRn), and configured to allocate said image as input to a plurality of said OCR engines (aTR1, aTR2,...aTRn) and receive machine-encoded texts as output from each of said plurality of said OCR engines (aTR1, aTR2,...aTRn), and- an OCR validation engine module (10), configured to receive said machine-encoded texts and assign a confidence level to each of said machine-encoded texts,wherein said text area dispatcher module (6) is further configured to receive said confidence levels from the OCR validation engine module (10), wherein said plurality of said OCR engines (aTR1, aTR2,...aTRn) is a proper subset of said collection (8) of OCR engines (aTR1, aTR2,...aTRn), and wherein said text area dispatcher module (6) is further configured to choose said plurality of said OCR engines (aTR1, aTR2,...aTRn) based on previously received confidence levels.
机译:用于从包含文本的图像中进行光学字符识别(OCR)的系统(1)可以最大程度地减少抄写错误,并提高生成的机器编码文本的质量。为此,该系统包括:-文本区域调度器模块(6),其连接到OCR引擎(aTR1,aTR2,... aTRn)的集合(8),并且被配置为将所述图像作为输入分配给多个所述OCR引擎(aTR1,aTR2) ,... aTRn)并接收机器编码的文本,作为来自所述多个所述OCR引擎(aTR1,aTR2,... aTRn)中的每一个的输出,以及-OCR验证引擎模块(10),被配置为接收所述机器编码文本并为每个所述机器编码文本分配置信度,其中,所述文本区域分派器模块(6)还被配置为从OCR验证引擎模块(10)接收所述置信度,其中,所述多个所述OCR引擎(aTR1,aTR2,... aTRn)是所述文本的适当子集。 OCR引擎(aTR1,aTR2,... aTRn)的集合(8),其中,所述文本区域分派器模块(6)进一步配置为基于多个所述OCR引擎(aTR1,aTR2,... aTRn)选择在以前收到的置信度上。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号