首页> 外文会议>Annual document recognition and retrieval conference >The Inference Process of Programmed Attributed Regular Grammars for Character Recognition
【24h】

The Inference Process of Programmed Attributed Regular Grammars for Character Recognition

机译:用于字符识别的编程归属常规语法的推理过程

获取原文
获取外文期刊封面目录资料

摘要

The paper presents the grammar inference engine of a pattern recognition system for character recognition. The input characters are identified, thinned to a one pixel width pattern and a feature-based description is provided. Using the syntactic recognition paradigm, the features are the set of terminals (or terminal symbols) for the application. The feature-based description includes a set of three attributes (i.e. A, B, C) for each feature. The combined feature and attribute description for each input pattern preserves in a more accurate way the structure of the original pattern. The grammar inference engine uses the feature-based description of each input pattern from the training set to build a grammar for each class of patterns. For each input pattern from the training set, the productions (rewriting rules) are derived together with all the necessary elements such as: the nonterminals, branch and testing conditions. Since the grammars are regular, the process of deriving the production rules is simple. All the productions are collected together providing the tags to be consecutive, without gaps. The size of the class grammars is reduced at an acceptable level for further processing using a set of Evans heuristic rules. These algorithms identifies the redundant productions, eliminating those productions and the correspondent nonterminal symbols. The stop criteria for the Evans thinning algorithm makes sure that no further reductions are possible. The last step of the grammar inference process enables the grammar to identify class members which were not in the training set: a cycling production rule. The above built grammars are used by the syntactic (character) classifier to identify the input patterns as being members of a-priori known classes.
机译:本文介绍了用于字符识别的模式识别系统的语法推理引擎。识别输入字符,减薄到一个像素宽度模式,并且提供了基于特征的描述。使用语法识别范例,特征是应用程序的终端(或终端符号)集。基于特征的描述包括每个特征的一组三个属性(即a,b,c)。每个输入模式的组合特征和属性描述以更准确的方式保留原始模式的结构。语法推理引擎使用来自训练集的每个输入模式的基于特征的描述,为每类模式构建语法。对于来自训练集的每个输入模式,产品(重写规则)与所有必要的元素一起导出,例如:非终端,分支和测试条件。由于语法是常规的,因此导出生产规则的过程很简单。所有制作都会收集在一起,使标签连续,没有间隙。在可接受的水平下,课堂语法的大小减少,以便使用一组埃文斯启发式规则进一步处理。这些算法识别冗余的制作,消除了这些制作和对应的非符号符号。 EVANS变薄算法的停止标准确保不再需要降低。语法推理过程的最后一步使语法能够识别不在培训集中的课程成员:骑自行车的生产规则。句法(字符)分类器使用上面的构建语法,以将输入模式标识为AS-Priori已知类的成员。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号