首页> 外国专利> Method for identifying word bounding boxes in text

Method for identifying word bounding boxes in text

机译:识别文字中的单词边界框的方法

摘要

A method for determining the boundaries of text or character strings represented in an array of image data by shape, without a requirement for individually detecting and/or identifying the character or characters making up the strings. The method relies upon the detection of connected components within words to first determine text line boundaries and to isolate the connected components into text rows. Subsequently, the structural relationships between the components within and defining rows (i.e. overlap, inter-character spacing, and inter-word spacing), are used to further combine adjacent sets of connected components into words or similar units of semantic understanding within text rows.
机译:一种用于通过形状确定图像数据阵列中表示的文本或字符串的边界的方法,而无需单独检测和/或识别组成字符串的一个或多个字符。该方法依靠对单词内的连接成分的检测来首先确定文本行边界,并将连接成分隔离为文本行。随后,使用行内和定义行之间的组件之间的结构关系(即重叠,字符间间隔和单词间间隔)来进一步将连接的组件的相邻集合组合成单词或文本行内的语义理解的相似单元。

著录项

  • 公开/公告号US5410611A

    专利类型

  • 公开/公告日1995-04-25

    原文格式PDF

  • 申请/专利权人 XEROX CORPORATION;

    申请/专利号US19930169949

  • 发明设计人 DANIEL P. HUTTENLOCHER;ERIC W. JAQUITH;

    申请日1993-12-17

  • 分类号G06K9/34;

  • 国家 US

  • 入库时间 2022-08-22 04:05:04

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号