首页> 外文会议>International Conference on Parallel, Distributed and Grid Computing >Zone Segmentation of a Text Line Printed in Gurmukhi Script Newspaper
【24h】

Zone Segmentation of a Text Line Printed in Gurmukhi Script Newspaper

机译:Gurmukhi脚本报纸上印刷的文本行的区域分割

获取原文

摘要

Newspapers consist of essential information and many efforts have been done for digitization and recognition of newspaper text. Few Gurumukhi script newspaper articles are available in digital form but searching of text is not possible on digital images. So, text processing is required for making searching possible on the text and also indexing of headlines. To recognize any text, segmentation of text into individual line is an important phase. As per characteristics of Gurmukhi script, text line can be divided into three zones namely upper zone, middle zone and lower zone. So, segmentation of an individual line into different zones is a preliminary phase for segmentation of text. Zone division is possible through detection of headline and baseline. Baseline detection is a tedious task because of the uneven presence of on and off pixels in a baseline. In this paper, authors have presented an algorithm for zone segmentation of Gurmukhi script newspaper text based on headline and baseline.
机译:报纸包含必不可少的信息,并且已经为报纸文本的数字化和识别做出了许多努力。很少有Gurumukhi脚本报纸的文章可以数字形式提供,但无法在数字图像上搜索文本。因此,需要进行文本处理才能在文本上进行搜索以及对标题进行索引。为了识别任何文本,将文本分割成单独的行是重要的阶段。根据古尔穆克语文字的特点,文本行可以分为三个区域,即上部区域,中间区域和下部区域。因此,将单个行分割为不同区域是文本分割的初步阶段。通过检测标题和基线可以进行区域划分。基线检测是一项繁琐的任务,因为基线中上下像素的不均匀存在。在本文中,作者提出了一种基于标题和基线的古尔穆希文字报纸文本区域分割算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号