首页> 外文期刊>The Visual Computer >Text and graphics segmentation of newspapers printed in Gurmukhi script: a hybrid approach
【24h】

Text and graphics segmentation of newspapers printed in Gurmukhi script: a hybrid approach

机译:在Gurmukhi脚本中印刷报纸的文本和图形细分:混合方法

获取原文
获取原文并翻译 | 示例
           

摘要

Newspapers are always a standard medium to convey important information to masses of people in recent time as well as in old time. An automated system is required to convert information into a processable form so that information could be searchable. Many efforts have been done on Gurmukhi script documents in typed or written form, but very few articles are present on Gurmukhi script newspaper text recognition or text and image segmentation. Image/graphics segmentation of text is mandatory before feeding newspaper text to OCR for accurate results. In the literature, many techniques have been proposed for segmenting images and text, but many are complex in nature. In this article, the authors have proposed a very simple and effective hybrid approach based on run length smoothing algorithm and projection profile to segment an image from text in Gurmukhi script newspaper articles. Both horizontal and vertical run length smearing algorithm is used for labeling the regions. Logical AND operator is applied to resultant images to identify the text and image regions. To segment the image region among the labeled regions, projection profile technique is implemented. The combination of these two techniques has produced very good results.
机译:报纸始终是一个标准媒介,以便在近期向众多人员和旧时间传达给群众的重要信息。需要自动化系统以将信息转换为可处理形式,以便可以搜索信息。在Gurmukhi脚本文档中以类型或书面形式进行了许多努力,但很少有文章存在于Gurmukhi脚本报纸文本识别或文本和图像细分上。在将报纸文本送到OCR之前,文本的图像/图形分段是强制性的,以获得准确的结果。在文献中,已经提出了许多技术用于分割图像和文本,但许多很复杂。在本文中,作者提出了一种基于运行长度平滑算法和投影配置文件的非常简单有效的混合方法,以便从Gurmukhi脚本报纸文章中段文本进行映像。水平和垂直运行长度涂抹算法都用于标记区域。逻辑和运算符应用于结果图像以识别文本和图像区域。为了在标记区域中分割图像区域,实现投影谱技术。这两种技术的组合产生了非常好的结果。

著录项

  • 来源
    《The Visual Computer》 |2021年第7期|1637-1659|共23页
  • 作者单位

    Guru Nanak Coll Dept Comp Applicat Muktsar Punjab India;

    Panjab Univ Reg Ctr Dept Comp Sci & Applicat Muktsar Punjab India;

    Maharaja Ranjit Singh Punjab Tech Univ Dept Computat Sci Bathinda Punjab India;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Segmentation; Gurmukhi; RLSA; Projection profiles;

    机译:分割;Gurmukhi;RLSA;投影配置文件;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号