首页> 外国专利> METHOD FOR SEGMENTING TEXT WORDS IN DOCUMENT IMAGES USING VERTICAL PROJECTIONS OF CENTER ZONES OF CHARACTERS

METHOD FOR SEGMENTING TEXT WORDS IN DOCUMENT IMAGES USING VERTICAL PROJECTIONS OF CENTER ZONES OF CHARACTERS

机译:使用字符中心区域的垂​​直投影对文档图像中的文本词进行分割的方法

摘要

A word segmentation method for segmenting a text line into word segments, which is particularly advantageous for processing italic text but can also be used for regular text. A horizontal center zone of the text line, corresponding to the vertical center parts of the characters, is used to generate a center-zone-only vertical projection profile. The center zone is determined using a horizontal projection profile, by locating the two major peaks of that profile and defining the two major peak positions as the upper and lower boundaries of the center zone. Spacing segments (white gaps) in the vertical projection profile are identified, and classified into two classes, namely character spacing (gap between characters with a word) and word spacing (gap between words). The word spacings are used to segment the text line into word segments.
机译:一种用于将文本行分割成多个词段的分词方法,这对于处理斜体文本特别有利,但也可以用于常规文本。文本行的水平中心区域与字符的垂直中心部分相对应,用于生成仅中心区域的垂​​直投影轮廓。通过定位该轮廓的两个主峰并将两个主峰位置定义为中心区域的上下边界,可以使用水平投影轮廓确定中心区域。标识垂直投影轮廓中的间距段(白色间隙),并将其分为两类,即字符间距(带有单词的字符之间的间隙)和单词间距(单词之间的间隙)。单词间距用于将文本行分段为单词段。

著录项

  • 公开/公告号US2016180163A1

    专利类型

  • 公开/公告日2016-06-23

    原文格式PDF

  • 申请/专利权人 KONICA MINOLTA LABORATORY U.S.A. INC.;

    申请/专利号US201414578066

  • 发明设计人 WEI MING;

    申请日2014-12-19

  • 分类号G06K9;G06K9/46;

  • 国家 US

  • 入库时间 2022-08-21 14:36:05

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号