首页> 外文会议>International workshop on combinatorial image analysis >Character Segmentation of Hindi Unconstrained Handwritten Words
【24h】

Character Segmentation of Hindi Unconstrained Handwritten Words

机译:印地语无约束手写单词的字符分割

获取原文

摘要

The proper character level segmentation of printed or handwritten text is an important preprocessing step for optical character recognition (OCR). It is noticed that the languages having cursive nature in writing make the segmentation problem much more complicated. Hindi is one of the well known language in India having this cursive nature in writing style. The main challenge in handwritten character segmentation is to handle the inherent variability in the writing style of different individuals. In this paper, we present an efficient character segmentation method for handwritten Hindi words. Segmentation is performed on the basis of some structural patterns observed in the writing style of this language. The proposed method can cope with high variations in writing style and skewed header lines as input. The method has been tested on our own database for both printed and handwritten words. The average success rate is 96.93%. The method yields fairly good results for this database comparing with other existing methods. We foresee that the proposed character segmenattion technique can be used as a part of an OCR system for cursive handwritten Hindi language.
机译:打印或手写文本的正确字符级别分段是光学字符识别(OCR)的重要预处理步骤。值得注意的是,具有草书性质的语言使分割问题变得更加复杂。印地语是印度最有名的语言之一,具有这种草书本性。手写字符分割的主要挑战是处理不同个人的写作风格固有的可变性。在本文中,我们提出了一种有效的手写北印度语单词字符分割方法。分割是根据以这种语言的写作风格观察到的一些结构模式进行的。所提出的方法可以应对书写风格的高变化和倾斜的标题行作为输入。该方法已在我们自己的数据库中针对印刷文字和手写文字进行了测试。平均成功率为96.93%。与其他现有方法相比,该方法对于此数据库产生相当不错的结果。我们预见到,提出的字符分割技术可以用作草书手写印地语的OCR系统的一部分。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号