首页> 外文会议>Annual IEEE India Conference >Shirorekha extraction in Character Segmentation for printed devanagri text in Document Image Processing

【24h】

Shirorekha extraction in Character Segmentation for printed devanagri text in Document Image Processing

机译：文档图像处理中印刷梵文文本的字符分割中的Shirorekha提取

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Finding Structural Layout, Text Line Segmentation, Word Level Segmentation and Character Level Segmentation is major step in offline OCR systems for Devanagari Script in Document Image Processing. This paper proposes a Word and Character Segmentation method for machine printed Devanagari text. A complete word and character segmentation system for Devanagari printed text is presented here. Sometimes, interline space and fused characters make line segmentation and character segmentation a difficult task respectively. We have tested our method on documents in Marathi scripts. A novel technique of character segmentation for printed Devanagari text is presented here. After removing the Shirorekha (header line) of Devanagari text, the bounding boxes are used to surround the segmented characters. Results obtained from this method are encouraging because of morphological operations. In this method we are proposing some basic morphological operations on the scanned document images and got much better results.

机译：查找结构布局，文本线段分割，字级分段和字符级分割是文档图像处理中Devanagari脚本的离线OCR系统的主要步骤。本文提出了一种机器打印的Devanagari文本的单词和字符分段方法。此处提出了一种用于Devanagari印刷文本的完整字和字符分段系统。有时，Interline空间和融合字符分别使线分割和字符分段分别成为困难的任务。我们已经在Marathi脚本中测试了我们的方法。在此提出了一种新的打印Devanagari文本的角色分割技术。删除Devanagari文本的Shirorekha（标题线）后，边界框用于围绕分段字符。从该方法获得的结果是由于形态学操作而令人鼓舞。在这种方法中，我们在扫描的文档图像上提出了一些基本的形态操作，并获得了更好的结果。

著录项

来源
《Annual IEEE India Conference》|2014年||共7页
会议地点
作者
Shinde A.B.; Dandawate Y.H.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词
document image processing; feature extraction; image segmentation; text detection; Marathi scripts; Shirorekha extraction; bounding boxes; character segmentation system; document image processing; header line; machine printed Devanagari text; morphological operations; word segmentation system; Image resolution; Image segmentation; Optical imaging; Radio frequency; Character Segmentation; Devanagari Script; Line Segmentation; Structural Layout; Word Segmentation;

机译：文档图像处理;特征提取;图像分割;文本检测;Shirorekha提取;边界框;字符分割系统;文档图像处理;标题线;机器印刷的devanagari文本;形态操作;单词分割系统;图像分辨率;图像分辨率;图像分辨率;图像分辨率;图像分辨率;图像分辨率;图像分辨率;图像分辨率;图像分辨率;图像分割;光学成像;射频;字符分割;devanagari脚本;线分割;结构布局;词分割;

相似文献

外文文献
中文文献
专利

1. SEGMENTATION OF OVERLAPPING TEXT LINES, CHARACTERS IN PRINTED TELUGU TEXT DOCUMENT IMAGES [J] . M Swamy Das, Dr. CRK Reddy, Dr. A Govardhan, International Journal of Engineering Science and Technology . 2010,第11期

机译：打印的泰卢固文本文档图像中重叠的文本行，字符的分段
2. Word Extraction and Character Segmentation from Text Lines of Unconstrained Handwritten Bangla Document Images [J] . Ram Sarkar, Samir Malakar, Nibaran Das, Journal of Intelligent Systems . 2011,第3期

机译：从不受约束的手写孟加拉语文档图像的文本行中提取单词并进行字符分割
3. Segmentation of text lines using multi-scale CNN from warped printed and handwritten document images [J] . Dutta Arpita, Garai Arpan, Biswas Samit, International Journal on Document Analysis and Recognition . 2021,第4期

机译：使用来自翘曲的打印和手写文档图像的多尺度CNN的文本线的分割
4. Shirorekha extraction in Character Segmentation for printed devanagri text in Document Image Processing [C] . Shinde A.B., Dandawate Y.H. Annual IEEE India Conference . 2014

机译：字符分割中的Shirorekha提取，用于文档图像处理中的已打印devanagri文本
5. Markov random field model based text segmentation and image post processing of complex scanned documents [D] . Haneda, Eri 2011

机译：基于马尔可夫随机场模型的复杂扫描文档的文本分割和图像后处理
6. Text Extraction from Scene Images by Character Appearance and Structure Modeling [O] . Chucai Yi, Yingli Tian -1

机译：通过字符外观和结构建模从场景图像提取文本
7. Extraction of Line Word Character Segments Directly from Run Length Compressed Printed Text Documents [O] . Javed, Mohammed, Nagabhushan, P., Chaudhuri, B. B. 2014

机译：从游程长度直接提取线词字符段压缩的印刷文本文档

Shirorekha extraction in Character Segmentation for printed devanagri text in Document Image Processing

摘要

著录项

相似文献

相关主题

期刊订阅