首页>
外国专利>
Combination of heterogeneous recognizer for image-based character recognition
Combination of heterogeneous recognizer for image-based character recognition
展开▼
机译:异构识别器的组合,用于基于图像的字符识别
展开▼
页面导航
摘要
著录项
相似文献
摘要
Approaches provide for recognizing and locating text represented in image data. For example, image data that includes representations of text can be obtained. A width-focused recognition engine can be configured to analyze the image data to determine a base-set of words. The base-set of words can be associated with logical structure information that describes a geometric relationship between words in the base-set of words. A set of bounding boxes that includes one or more base words can be determined, as well as a confidence value for each base word. A depth-focused recognition engine can be configured to analyze the image data to determine a focused-set of words, the focused-set of words associated with a set of bounding boxes and confidence values for respective words. A set of merged words can be determined from a set of overlapping bounding boxes that overlap a threshold amount. The set of merged words can include at least a portion of the base-set of words and/or the focused-set of words and are selected based at least in part on respective confidence values of words in the set of overlapping bounding boxes. Thereafter, a final set of words that includes the merged set of words and appended words can be determined.
展开▼