首页> 外文会议>International Workshop on Combinatorial Image Analysis >Character Segmentation of Hindi Unconstrained Handwritten Words
【24h】

Character Segmentation of Hindi Unconstrained Handwritten Words

机译:印地语不受约束手写词的字符分割

获取原文

摘要

The proper character level segmentation of printed or hand-written text is an important preprocessing step for optical character recognition (OCR). It is noticed that the languages having cursive nature in writing make the segmentation problem much more complicated. Hindi is one of the well known language in India having this cursive nature in writing style. The main challenge in handwritten character segmentation is to handle the inherent variability in the writing style of different individuals. In this paper, we present an efficient character segmentation method for handwritten Hindi words. Segmentation is performed on the basis of some structural patterns observed in the writing style of this language. The proposed method can cope with high variations in writing style and skewed header lines as input. The method has been tested on our own database for both printed and handwritten words. The average success rate is 96.93%. The method yields fairly good results for this database comparing with other existing methods. We foresee that the proposed character segmenattion technique can be used as a part of an OCR system for cursive handwritten Hindi language.
机译:打印或手写文本的合适字符级分割是用于光学字符识别(OCR)的重要预处理步骤。注意到具有写作中的草赋自然的语言使分段问题更加复杂。印地语是印度的知名语言之一,在写作风格中具有这种卷发性。手写字符分割中的主要挑战是处理不同个人的书写风格中的固有变异性。在本文中,我们提出了一种用于手写的印地文字的有效的字符分段方法。基于在这种语言的写作风格中观察到的某些结构模式的基础上进行分割。所提出的方法可以应对写入风格和偏斜线线的高变化作为输入。该方法已在我们自己的数据库上测试了印刷和手写单词。平均成功率为96.93%。该方法对与其他现有方法进行比较,该数据库产生了相当好的结果。我们预见到所提出的字符SegMenattion技术可以用作法式手写印地语语言的OCR系统的一部分。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号