首页>
外国专利>
Defining a layout of text lines of CJK and non-CJK characters
Defining a layout of text lines of CJK and non-CJK characters
展开▼
机译:定义CJK和非CJK字符的文本行的布局
展开▼
页面导航
摘要
著录项
相似文献
摘要
A method is described for creating a scheme for dividing a text line of Chinese, Japanese or Korean (CJK) characters into character cells prior to applying classifiers and recognizing individual characters. Gaps between characters are found as a window is moved down the length of a text line. A histogram is built based on distances from the start of the window to a respective gap as the window is moved. The window is moved to the end of each gap after each gap is found and distances measured. This is repeated until the window reaches the end of the text line. A linear division graph (LDG) is constructed according to the detected gaps. Penalties for certain distances are applied. An optimum path is one with a minimal penalty sum and can be used as a scheme for dividing a text line into character cells.
展开▼