End-to-End Measure for Text Recognition

机译：端到端的文本识别措施

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Measuring the performance of text recognition and text line detection engines is an important step to objectively compare systems and their configuration. There exist well-established measures for both tasks separately. However, there is no sophisticated evaluation scheme to measure the quality of a combined text line detection and text recognition system. The F-measure on word level is a well-known methodology, which is sometimes used in this context. Nevertheless, it does not take into account the alignment of hypothesis and ground truth text and can lead to deceptive results. Since users of automatic information retrieval pipelines in the context of text recognition are mainly interested in the end-to-end performance of a given system, there is a strong need for such a measure. Hence, we present a measure to evaluate the quality of an end-to-end text recognition system. The basis for this measure is the well established and widely used character error rate, which is limited - in its original form - to aligned hypothesis and ground truth texts. The proposed measure is flexible in a way that it can be configured to penalize different reading orders between the hypothesis and ground truth and can take into account the geometric position of the text lines. Additionally, it can ignore over-and under-segmentation of text lines. With these parameters it is possible to get a measure fitting best to its own needs.

机译：测量文本识别和文本行检测引擎的性能是客观比较系统及其配置的重要步骤。对于这两项任务，分别存在完善的措施。但是，没有复杂的评估方案来衡量组合的文本行检测和文本识别系统的质量。单词级别的F度量是一种众所周知的方法，有时在这种情况下使用。然而，它没有考虑到假设和事实真相的统一，并可能导致欺骗性结果。由于在文本识别上下文中自动信息检索管道的用户主要对给定系统的端到端性能感兴趣，因此非常需要这种措施。因此，我们提出了一种评估端到端文本识别系统质量的措施。这项措施的基础是公认的且广泛使用的字符错误率，该错误率以其原始形式仅限于一致的假设和基本事实文本。拟议的措施具有一定的灵活性，可以配置为惩罚假设和基本事实之间的不同阅读顺序，并且可以考虑文本行的几何位置。此外，它可以忽略文本行的过度分割和不足分割。有了这些参数，就有可能获得最适合其自身需求的量度。

著录项

来源
《International Conference on Document Analysis and Recognition》|2019年|1424-1431|共8页
会议地点
作者
Gundram Leifert; Roger Labahn; Tobias Grüning; Svenja Leifert;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Text recognition; Extraterrestrial measurements; Error analysis; Task analysis; Information retrieval; Text analysis; Computational intelligence;

机译：文本识别;外星测量;误差分析;任务分析;信息检索;文本分析;计算智能;
入库时间 2022-08-26 14:34:51

相似文献

外文文献
中文文献
专利

1. Cursive-Text: A Comprehensive Dataset for End-to-End Urdu Text Recognition in Natural Scene Images [J] . Asghar Ali Chandio, Md. Asikuzzaman, Mark Pickering, Data in Brief . 2020,第3期

机译：Cursive-Text：自然场景图像中的端到端核心文本识别的全面数据集
2. An End-to-End Trainable Neural Network for Image-Based Sequence Recognition and Its Application to Scene Text Recognition [J] . Baoguang Shi, Xiang Bai, Cong Yao IEEE Transactions on Pattern Analysis and Machine Intelligence . 2017,第11期

机译：基于端到端的可训练神经网络基于图像的序列识别及其在场景文本识别中的应用
3. End-to-end DNN based text-independent speaker recognition for long and short utterances [J] . Rohdin Johan, Silnova Anna, Diez Mireia, Computer speech and language . 2020,第Jana期

机译：基于端到端DNN的，与文本无关的说话人识别，可实现长话和短话
4. End-to-End Measure for Text Recognition [C] . Gundram Leifert, Roger Labahn, Tobias Grüning, International Conference on Document Analysis and Recognition . 2019

机译：文本识别的端到端措施
5. End-to-end Approach for Gesture Recognition from 3D Data [D] . Owoyemi Toluwaleke Joshua 2019

机译：从3D数据进行手势识别的端到端方法
6. Cursive-Text: A Comprehensive Dataset for End-to-End Urdu Text Recognition in Natural Scene Images [O] . Asghar Ali Chandio, Md. Asikuzzaman, Mark Pickering, 2020

机译：草书文本：用于自然场景图像中端到端乌尔都语文本识别的综合数据集
7. An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition [O] . Shi, Baoguang, Bai, Xiang, Yao, Cong 2015

机译：基于图像序列的端到端可训练神经网络识别及其在场景文本识别中的应用

End-to-End Measure for Text Recognition

摘要

著录项

相似文献

相关主题

期刊订阅