首页> 外文期刊>International Journal of Computer Vision >Label Embedding: A Frugal Baseline for Text Recognition
【24h】

Label Embedding: A Frugal Baseline for Text Recognition

机译:标签嵌入:节俭的文本识别基准

获取原文
获取原文并翻译 | 示例
       

摘要

The standard approach to recognizing text in images consists in first classifying local image regions into candidate characters and then combining them with high-level word models such as conditional random fields. This paper explores a new paradigm that departs from this bottom-up view. We propose to embed word labels and word images into a common Euclidean space. Given a word image to be recognized, the text recognition problem is cast as one of retrieval: find the closest word label in this space. This common space is learned using the Structured SVM framework by enforcing matching label-image pairs to be closer than non-matching pairs. This method presents several advantages: it does not require ad-hoc or costly pre-/post-processing operations, it can build on top of any state-of-the-art image descriptor (Fisher vectors in our case), it allows for the recognition of never-seen-before words (zero-shot recognition) and the recognition process is simple and efficient, as it amounts to a nearest neighbor search. Experiments are performed on challenging datasets of license plates and scene text. The main conclusion of the paper is that with such a frugal approach it is possible to obtain results which are competitive with standard bottom-up approaches, thus establishing label embedding as an interesting and simple to compute baseline for text recognition.
机译:识别图像中文本的标准方法包括:首先将局部图像区域分类为候选字符,然后将其与高级单词模型(例如条件随机字段)组合。本文探索了一种不同于这种自下而上的观点的新范式。我们建议将单词标签和单词图像嵌入到共同的欧几里得空间中。给定要识别的单词图像,将文本识别问题转换为检索之一:在该空间中找到最接近的单词标签。使用结构化SVM框架通过强制匹配的标签-图像对比不匹配的对更近来学习此公共空间。这种方法具有几个优点:它不需要临时或昂贵的前/后处理操作,它可以建立在任何最新的图像描述符(本例中为Fisher矢量)的基础上,它允许从未见过的单词的识别(零镜头识别)和识别过程简单而有效,因为它相当于最近的邻居搜索。实验是在具有挑战性的车牌和场景文本数据集上进行的。本文的主要结论是,通过这种节俭的方法,可以获得与标准的自下而上方法具有竞争力的结果,从而将标签嵌入确立为有趣且易于计算的文本识别基线。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号