首页> 外文会议>Document Recognition III >Evaluation of document image skew estimation techniques
【24h】

Evaluation of document image skew estimation techniques

机译:评估文档图像偏斜估计技术

获取原文

摘要

Abstract: Recently there has been an increased interest in document image skew detection algorithms. Most of the papers relevant to this problem include some experimental results. However, there exists a lack of a universally accepted methodology for evaluating the performance of such algorithms. We have implemented four types of skew detection algorithms in order to investigate possible testing methodologies. We then tested each algorithm on a sample of 460 page images randomly selected from a collection of approximately 100,000 pages. This collection contains a wide variety of typographical features and styles. In our evaluation we examine several issues relevant to the establishment of a uniform testing methodology. First, we begin with a clear definition of the problem and the ground truth collection process. Then we examine the need for pre-processing and parameter optimization specific to each technique. Next, we investigate the problem of establishing meaningful statistical measurements of the performance of these algorithms and the use of non-parametric comparison methods to perform pairwise comparisons of methods. Lastly, we look at the sensitivity of each algorithm to particular typographical features, which indicates the need for the adoption of a stratified sampling paradigm for accurate analysis of performance. !14
机译:摘要:最近,人们对文档图像歪斜检测算法的兴趣日益浓厚。与该问题有关的大多数论文都包含一些实验结果。但是,缺乏评估这种算法的性能的普遍接受的方法。为了研究可能的测试方法,我们已经实现了四种类型的偏斜检测算法。然后,我们在从大约100,000页的集合中随机选择的460页图像的样本上测试了每种算法。该集合包含各种印刷功能和样式。在我们的评估中,我们研究了与建立统一测试方法有关的几个问题。首先,我们首先要对问题和地面真相收集过程进行清晰的定义。然后,我们研究了针对每种技术的预处理和参数优化的需求。接下来,我们研究对这些算法的性能建立有意义的统计度量以及使用非参数比较方法进行方法的成对比较的问题。最后,我们研究了每种算法对特定印刷特征的敏感性,这表明需要采用分层抽样范式来准确分析性能。 !14

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号