首页> 外文期刊>Expert systems with applications >A comprehensive comparison of end-to-end approaches for handwritten digit string recognition
【24h】

A comprehensive comparison of end-to-end approaches for handwritten digit string recognition

机译:关于手写数字字符串识别的结束方法的全面比较

获取原文
获取原文并翻译 | 示例
           

摘要

Over the last decades, most approaches proposed for handwritten digit string recognition (HDSR) have resorted to digit segmentation, which is dominated by heuristics, thereby imposing substantial constraints on the final performance. Few of them have been based on segmentation-free strategies where each pixel column has a potential cut location. Recently, segmentation-free strategies has added another perspective to the problem, leading to promising results. However, these strategies still show some limitations when dealing with a large number of touching digits. To bridge the resulting gap, in this paper, we hypothesize that a string of digits can be approached as a sequence of objects. We thus evaluate different end-to-end approaches to solve the HDSR problem, particularly in two verticals: those based on object-detection (e.g., Yolo and RetinaNet) and those based on sequence-to-sequence representation (CRNN).The main contribution of this work lies in its provision of a comprehensive comparison with a critical analysis of the above mentioned strategies on five benchmarks commonly used to assess HDSR, including the challenging Touching Pair dataset, NIST SD19, and two real-world datasets (CAR and CVL) proposed for the ICFHR 2014 competition on HDSR. Our results show that the Yolo model compares favorably against segmentation-free models with the advantage of having a shorter pipeline that minimizes the presence of heuristics-based models. It achieved a 97%, 96%, and 84% recognition rate on the NIST-SD19, CAR, and CVL datasets, respectively.
机译:在过去的几十年中,为手写数字字符串识别(HDSR)提出的大多数方法都采用了位于启发式的数字分割,从而对最终表现施加了大量的限制。其中很少有基于分段的策略,其中每个像素列具有潜在的切割位置。最近,分割策略已经增加了对问题的另一个观点,导致了有希望的结果。但是,在处理大量触摸的数字时,这些策略仍然显示了一些限制。为了弥合所产生的差距,在本文中,我们假设可以将一串数字串作为一系列物体接近。因此,我们评估不同的端到端方法来解决HDSR问题,特别是在两个垂直方面:基于对象检测(例如,YOLO和RETINALET)的那些,基于序列到序列表示(CRNN)。主要这项工作的贡献在于提供全面的比较,其与上述策略对常用于评估HDSR的五个基准的关键分析,包括挑战触摸对数据集,NIST SD19和两个现实世界数据集(汽车和CVL )为HDSR的ICFHR 2014年竞争提出。我们的研究结果表明,YOLO模型对分割模型进行了比较,其优点是具有更短的管道,最小化基于启发式的模型的存在。它分别在NIST-SD19,CAR和CVL数据集上达到了97%,96%和84%的识别率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号