首页> 外文会议>Annual IEEE India Conference >Shirorekha extraction in Character Segmentation for printed devanagri text in Document Image Processing
【24h】

Shirorekha extraction in Character Segmentation for printed devanagri text in Document Image Processing

机译:文档图像处理中印刷梵文文本的字符分割中的Shirorekha提取

获取原文

摘要

Finding Structural Layout, Text Line Segmentation, Word Level Segmentation and Character Level Segmentation is major step in offline OCR systems for Devanagari Script in Document Image Processing. This paper proposes a Word and Character Segmentation method for machine printed Devanagari text. A complete word and character segmentation system for Devanagari printed text is presented here. Sometimes, interline space and fused characters make line segmentation and character segmentation a difficult task respectively. We have tested our method on documents in Marathi scripts. A novel technique of character segmentation for printed Devanagari text is presented here. After removing the Shirorekha (header line) of Devanagari text, the bounding boxes are used to surround the segmented characters. Results obtained from this method are encouraging because of morphological operations. In this method we are proposing some basic morphological operations on the scanned document images and got much better results.
机译:查找结构布局,文本线段分割,字级分段和字符级分割是文档图像处理中Devanagari脚本的离线OCR系统的主要步骤。本文提出了一种机器打印的Devanagari文本的单词和字符分段方法。此处提出了一种用于Devanagari印刷文本的完整字和字符分段系统。有时,Interline空间和融合字符分别使线分割和字符分段分别成为困难的任务。我们已经在Marathi脚本中测试了我们的方法。在此提出了一种新的打印Devanagari文本的角色分割技术。删除Devanagari文本的Shirorekha(标题线)后,边界框用于围绕分段字符。从该方法获得的结果是由于形态学操作而令人鼓舞。在这种方法中,我们在扫描的文档图像上提出了一些基本的形态操作,并获得了更好的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号