【24h】

End-to-End Measure for Text Recognition

机译:端到端的文本识别措施

获取原文

摘要

Measuring the performance of text recognition and text line detection engines is an important step to objectively compare systems and their configuration. There exist well-established measures for both tasks separately. However, there is no sophisticated evaluation scheme to measure the quality of a combined text line detection and text recognition system. The F-measure on word level is a well-known methodology, which is sometimes used in this context. Nevertheless, it does not take into account the alignment of hypothesis and ground truth text and can lead to deceptive results. Since users of automatic information retrieval pipelines in the context of text recognition are mainly interested in the end-to-end performance of a given system, there is a strong need for such a measure. Hence, we present a measure to evaluate the quality of an end-to-end text recognition system. The basis for this measure is the well established and widely used character error rate, which is limited - in its original form - to aligned hypothesis and ground truth texts. The proposed measure is flexible in a way that it can be configured to penalize different reading orders between the hypothesis and ground truth and can take into account the geometric position of the text lines. Additionally, it can ignore over-and under-segmentation of text lines. With these parameters it is possible to get a measure fitting best to its own needs.
机译:测量文本识别和文本行检测引擎的性能是客观比较系统及其配置的重要步骤。对于这两项任务,分别存在完善的措施。但是,没有复杂的评估方案来衡量组合的文本行检测和文本识别系统的质量。单词级别的F度量是一种众所周知的方法,有时在这种情况下使用。然而,它没有考虑到假设和事实真相的统一,并可能导致欺骗性结果。由于在文本识别上下文中自动信息检索管道的用户主要对给定系统的端到端性能感兴趣,因此非常需要这种措施。因此,我们提出了一种评估端到端文本识别系统质量的措施。这项措施的基础是公认的且广泛使用的字符错误率,该错误率以其原始形式仅限于一致的假设和基本事实文本。拟议的措施具有一定的灵活性,可以配置为惩罚假设和基本事实之间的不同阅读顺序,并且可以考虑文本行的几何位置。此外,它可以忽略文本行的过度分割和不足分割。有了这些参数,就有可能获得最适合其自身需求的量度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号