首页> 外文期刊>EURASIP journal on applied signal processing >Fast and Accurate Ground Truth Generation for Skew-Tolerance Evaluation of Page Segmentation Algorithms
【24h】

Fast and Accurate Ground Truth Generation for Skew-Tolerance Evaluation of Page Segmentation Algorithms

机译:快速准确的地面真相生成,用于页面分割算法的偏斜公差评估

获取原文
获取原文并翻译 | 示例
           

摘要

Many image segmentation algorithms are known, but often there is an inherent obstacle in the unbiased evaluation of segmentation quality: the absence or lack of a common objective representation for segmentation results. Such a representation, known as the ground truth, is a description of what one should obtain as the result of ideal segmentation, independently of the segmentation algorithm used. The creation of ground truth is a laborious process and therefore any degree of automation is always welcome. Document image analysis is one of the areas where ground truths are employed. In this paper, we describe an automated tool called GROTTO intended to generate ground truths for skewed document images, which can be used for the performance evaluation of page segmentation algorithms. Some of these algorithms are claimed to be insensitive to skew (tilt of text lines). However, this fact is usually supported only by a visual comparison of what one obtains and what one should obtain since ground truths are mostly available for upright images, that is, those without skew. As a result, the evaluation is both subjective; that is, prone to errors, and tedious. Our tool allows users to quickly and easily produce many sufficiently accurate ground truths that can be employed in practice and therefore it facilitates automatic performance evaluation. The main idea is to utilize the ground truths available for upright images and the concept of the representative square [9] in order to produce the ground truths for skewed images. The usefulness of our tool is demonstrated through a number of experiments with real-document images of complex layout.
机译:已知许多图像分割算法,但是在无偏估计分割质量时通常会存在一个固有的障碍:缺少或缺乏分割结果的通用客观表示。这种表示法,称为基本事实,是对理想分割结果应获得的描述,与所使用的分割算法无关。建立基础真理是一个费力的过程,因此始终欢迎任何程度的自动化。文档图像分析是采用基本事实的领域之一。在本文中,我们描述了一种名为GROTTO的自动化工具,旨在为歪斜的文档图像生成基本事实,可用于页面分割算法的性能评估。这些算法中的某些算法据称对倾斜(文本行的倾斜)不敏感。但是,通常只能通过对所获得的内容和应获得的内容进行视觉比较来支持这一事实,因为地面真相主要适用于直立图像,即那些没有歪斜的图像。结果,评估都是主观的。也就是说,容易出错并且乏味。我们的工具使用户能够快速,轻松地产生许多可以在实践中使用的足够准确的基础事实,因此有助于进行自动性能评估。主要思想是利用可用于直立图像的地面实况和代表性正方形的概念[9],以便为倾斜图像生成地面实况。我们通过对具有复杂布局的真实文档图像进行的大量实验证明了我们工具的实用性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号