...
首页> 外文期刊>IETE Technical Review >Review of Text Extraction Algorithms for Scene-text and Document Images
【24h】

Review of Text Extraction Algorithms for Scene-text and Document Images

机译:场景文本和文档图像的文本提取算法综述

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

One of the major applications of text retrieval from images is to extract the text information and then recognize its characters. This is helpful for indexing the images within storage media. When we want to search a particular image or document, there is no need to go through a large bunch of images. We go only through the group of indexed images, so that the task of finding the particular image becomes easy. Extracting text lines from scanned document images present a major problem in optical character recognition process as skewed text lines raise the complexity. The problem gets even worse with the text lines of different orientations. Such lines are called as multi-skewed lines. These multi-skewed lines are easily observed in both printed and handwritten documents. It is a challenging task to design a real time system, which can maintain a high recognition rate with good accuracy and is independent of the type of documents and character fonts. In this paper, we attempt to analyze and classify the various text extraction schemes for the scene-text and document images. We also compare different approaches of these images based on common problems and discuss their merits and demerits.
机译:从图像检索文本的主要应用之一是提取文本信息,然后识别其字符。这有助于索引存储介质中的图像。当我们要搜索特定的图像或文档时,无需浏览大量图像。我们仅遍历索引图像组,因此查找特定图像的任务变得容易。从被扫描的文档图像中提取文本行是光学字符识别过程中的一个主要问题,因为倾斜的文本行会增加复杂性。对于不同方向的文本行,问题变得更加严重。这样的线称为多偏斜线。在打印和手写文档中都容易观察到这些多角度的线条。设计一个实时系统是一项具有挑战性的任务,该系统可以保持较高的识别率并具有良好的准确性,并且与文档和字体的类型无关。在本文中,我们尝试对场景文本和文档图像的各种文本提取方案进行分析和分类。我们还根据常见问题比较了这些图像的不同方法,并讨论了它们的优缺点。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号