首页> 外国专利> Automatic training of character templates using a text line image, a text line transcription and a line image source model

Automatic training of character templates using a text line image, a text line transcription and a line image source model

机译：使用文本行图像，文本行转录和行图像源模型自动训练字符模板

页面导航

摘要
著录项
相似文献

摘要

A technique for automatically producing, or training, a set of bitmapped character templates defined according to the sidebearing model of character image positioning uses as input a text line image of unsegmented characters, called glyphs, as the source of training samples. The training process also uses a transcription associated with the text line image, and an explicit, grammar-based text line image source model that describes the structural and functional features of a set of possible text line images that may be used as the source of training samples. The transcription may be a literal transcription of the line image, or it may be nonliteral, for example containing logical structure tags for document formatting and layout, such as found in markup languages. Spatial positioning information modeled by the text line image source model and the labels in the transcription are used to determine labeled image positions identifying the location of glyph samples occurring in the input line image, and the character templates are produced using the labeled image positions. In another aspect of the technique, a set of character templates defined by any character template model, such as a segmentation-based model, is produced using the grammar- based text line image source model and specifically using a tag transcription containing logical structure tags for document formatting and layout. Both aspects of the training technique may represent the text line image source model and the transcription as finite state networks.

机译：一种自动产生或训练根据字符图像定位的侧面轴承模型定义的位图字符模板的集合的技术，将未分段字符的文本行图像（称为字形）用作输入，作为训练样本的源。训练过程还使用与文本行图像关联的转录，以及基于显式，基于语法的文本行图像源模型，该模型描述了可能用作训练源的一组可能的文本行图像的结构和功能特征样品。转录可以是行图像的文字转录，也可以是非文字的，例如包含用于文档格式化和布局的逻辑结构标签，例如在标记语言中找到的标签。由文本行图像源模型和转录中的标签建模的空间定位信息用于确定标记的图像位置，该位置标识出现在输入行图像中的字形样本的位置，并且使用标记的图像位置生成字符模板。在该技术的另一方面，由任何字符模板模型（例如基于分段的模型）定义的一组字符模板是使用基于语法的文本行图像源模型，特别是使用包含用于以下内容的逻辑结构标签的标签转录生成的：文档格式和布局。训练技术的两个方面都可以将文本行图像源模型和转录表示为有限状态网络。

著录项

公开/公告号US5594809A

专利类型
公开/公告日1997-01-14

原文格式PDF
申请/专利权人 XEROX CORPORATION;
展开▼

申请/专利号US19950431253
发明设计人 PHILIP A. CHOU;LESLIE T. NILES;GARY E. KOPEC;
展开▼

申请日1995-04-28
分类号G06K9/62;
国家 US
入库时间 2022-08-22 03:10:45

相似文献

专利
外文文献
中文文献