首页> 外国专利> Combination of heterogeneous recognizer for image-based character recognition

Combination of heterogeneous recognizer for image-based character recognition

机译:异构识别器的组合,用于基于图像的字符识别

摘要

Approaches provide for recognizing and locating text represented in image data. For example, image data that includes representations of text can be obtained. A width-focused recognition engine can be configured to analyze the image data to determine a base-set of words. The base-set of words can be associated with logical structure information that describes a geometric relationship between words in the base-set of words. A set of bounding boxes that includes one or more base words can be determined, as well as a confidence value for each base word. A depth-focused recognition engine can be configured to analyze the image data to determine a focused-set of words, the focused-set of words associated with a set of bounding boxes and confidence values for respective words. A set of merged words can be determined from a set of overlapping bounding boxes that overlap a threshold amount. The set of merged words can include at least a portion of the base-set of words and/or the focused-set of words and are selected based at least in part on respective confidence values of words in the set of overlapping bounding boxes. Thereafter, a final set of words that includes the merged set of words and appended words can be determined.
机译:提供了识别和定位图像数据中表示的文本的方法。例如,可以获得包括文本表示的图像数据。可以将宽度集中的识别引擎配置为分析图像数据,以确定单词的基本集。词的基本集合可以与描述词的基本集合中的词之间的几何关系的逻辑结构信息相关联。可以确定包括一个或多个基本单词的一组边界框,以及每个基本单词的置信度值。深度聚焦识别引擎可以被配置为分析图像数据以确定词的聚焦集,与一组边界框相关联的词的聚焦集以及各个词的置信度值。可以从与阈值量重叠的一组重叠的包围盒中确定一组合并的单词。合并单词的集合可以包括单词的基本集合和/或单词的集中的集合的至少一部分,并且至少部分地基于重叠的边界框的集合中的单词的相应置信度值来选择。此后,可以确定包括词的合并集合和附加词的最终词集合。

著录项

  • 公开/公告号US10445569B1

    专利类型

  • 公开/公告日2019-10-15

    原文格式PDF

  • 申请/专利权人 A9.COM INC.;

    申请/专利号US201615251832

  • 发明设计人 XIAOFAN LIN;SON DINH TRAN;

    申请日2016-08-30

  • 分类号G06K9;G06T7/60;G06K9/52;G06K9/66;G06T7;G06K9/46;G06K9/62;G06F17/27;

  • 国家 US

  • 入库时间 2022-08-21 12:16:42

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号