首页> 外文会议>Document Recognition IV >Component-based handprint segmentation using adaptive writing style model
【24h】

Component-based handprint segmentation using adaptive writing style model

机译:基于自适应写作风格模型的基于组件的手印分割

获取原文

摘要

Abstract: Building upon the utility of connected components, NIST has designed a new character segmentor based on statistically modeling the style of a person's handwriting. Simple spatial features capture the characteristics of a particular writer's style of handprint, enabling the new method to maintain a traditional character-level segmentation philosophy without the integration of recognition or the use of oversegmentation and linguistic postprocessing. Estimates for stroke width and character height are used to compute aspect ratio and standard stroke count features that adapt to the writer's style at the field level. The new method has been developed with a predetermined set of fuzzy rules making the segmentor much less fragile and much more adaptive, and the new method successfully reconstructs fragmented characters as well as splits touching characters. The new segmentor was integrated into the NIST public domain form-based handprint recognition systems and then tested on a set of 490 handwriting sample forms found in NIST special database 19. When compared to a simple component-based segmentor, the new adaptable method improved the overall recognition of handprinted digits by 3.4 percent and field level recognition by 6.9 percent, while effectively reducing deletion errors by 82 percent. The same program code and set of parameters successfully segments sequences of uppercase and lowercase characters without any context-based tuning. While not as dramatic as digits, the recognition of uppercase and lowercase characters improved by 1.7 percent and 1.3 percent respectively. The segmentor maintains a relatively straight-forward and logical process flow avoiding convolutions of encoded exceptions as is common in expert systems. As a result, the new segmentor operates very efficiently, and throughput as high as 362 characters per second can be achieved. Letters and numbers are constructed from a predetermined configuration of a relatively small number of strokes. Results in this paper show that capitalizing on this knowledge through the use of simple adaptable features can significantly improve segmentation, whereas recognition-based and oversegmentation methods fail to take advantage of these intrinsic qualities of handprinted characters. !17
机译:摘要:NIST基于连接的组件的实用性,在对人的笔迹进行统计建模的基础上,设计了一种新的字符分割器。简单的空间特征捕获了特定作者手印风格的特征,从而使新方法能够保持传统的字符级分割哲学,而无需集成识别或使用过度分割和语言后处理。笔划宽度和字符高度的估计值用于计算纵横比和标准笔划计数功能,这些功能可在字段级别适应作者的风格。已经开发出具有预定的模糊规则集的新方法,该模糊规则使得分割器更不易碎,并且更具适应性,并且该新方法成功地重建了碎片字符以及分割了触摸字符。新的细分器已集成到基于NIST公共领域表单的手印识别系统中,然后在NIST专用数据库19中找到的490个手写样本表单集上进行了测试。与基于组件的简单细分器相比,新的自适应方法改进了手写数字的整体识别率达到3.4%,字段级别的识别率达到6.9%,同时有效地减少了82%的删除错误。相同的程序代码和参数集可以成功地分割大写和小写字符序列,而无需进行任何基于上下文的调整。尽管不如数字那么引人注目,但大写和小写字符的识别率分别提高了1.7%和1.3%。分段器保持相对简单和逻辑的处理流程,避免了专家系统中常见的编码异常卷积。结果,新的分割器非常高效地运行,并且可以实现高达每秒362个字符的吞吐量。字母和数字是由相对较少的笔划的预定配置构造而成的。本文的结果表明,通过使用简单的自适应功能来利用此知识可以显着改善分割效果,而基于识别的和过度分割的方法则无法利用这些手印字符的内在品质。 !17

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号