首页> 外国专利> Defining a layout of text lines of CJK and non-CJK characters

Defining a layout of text lines of CJK and non-CJK characters

机译:定义CJK和非CJK字符的文本行的布局

摘要

A method is described for creating a scheme for dividing a text line of Chinese, Japanese or Korean (CJK) characters into character cells prior to applying classifiers and recognizing individual characters. Gaps between characters are found as a window is moved down the length of a text line. A histogram is built based on distances from the start of the window to a respective gap as the window is moved. The window is moved to the end of each gap after each gap is found and distances measured. This is repeated until the window reaches the end of the text line. A linear division graph (LDG) is constructed according to the detected gaps. Penalties for certain distances are applied. An optimum path is one with a minimal penalty sum and can be used as a scheme for dividing a text line into character cells.
机译:描述了一种用于创建方案的方法,该方案用于在应用分类器和识别单个字符之前将中文,日文或韩文(CJK)字符的文本行划分为字符单元。当窗口沿文本行的长度向下移动时,会发现字符之间的间隙。根据从窗口开始到窗口移动时各个间隙的距离建立直方图。找到每个间隙并测量距离后,窗口将移动到每个间隙的末端。重复此操作,直到窗口到达文本行的末尾。根据检测到的间隙构造线性分割图(LDG)。某些距离适用罚款。最佳路径是惩罚总和最小的路径,可以用作将文本行划分为字符单元的方案。

著录项

  • 公开/公告号US8559718B1

    专利类型

  • 公开/公告日2013-10-15

    原文格式PDF

  • 申请/专利权人 YURI CHULININ;

    申请/专利号US201213457968

  • 发明设计人 YURI CHULININ;

    申请日2012-04-27

  • 分类号G06K9/00;

  • 国家 US

  • 入库时间 2022-08-21 16:47:12

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号