首页> 外文会议>European conference on computer vision >Deep Features for Text Spotting
【24h】

Deep Features for Text Spotting

机译:文字斑点的深层功能

获取原文

摘要

The goal of this work is text spotting in natural images. This is divided into two sequential tasks: detecting words regions in the image, and recognizing the words within these regions. We make the following contributions: first, we develop a Convolutional Neural Network (CNN) classifier that can be used for both tasks. The CNN has a novel architecture that enables efficient feature sharing (by using a number of layers in common) for text detection, character case-sensitive and insensitive classification, and bigram classification. It exceeds the state-of-the-art performance for all of these. Second, we make a number of technical changes over the traditional CNN architectures, including no downsampling for a per-pixel sliding window, and multi-mode learning with a mixture of linear models (maxout). Third, we have a method of automated data mining of Flickr, that generates word and character level annotations. Finally, these components are used together to form an end-to-end, state-of-the-art text spotting system. We evaluate the text-spotting system on two standard benchmarks, the ICDAR Robust Reading data set and the Street View Text data set, and demonstrate improvements over the state-of-the-art on multiple measures.
机译:这项工作的目标是在自然图像中发现文本。这分为两个连续的任务:检测图像中的单词区域,并识别这些区域中的单词。我们做出了以下贡献:首先,我们开发了可用于两个任务的卷积神经网络(CNN)分类器。 CNN具有新颖的体系结构,可实现有效的特征共享(通过使用多个公共层)以进行文本检测,区分大小写和不区分大小写的字符分类以及bigram分类。在所有这些方面,它都超过了最先进的性能。其次,我们对传统的CNN架构进行了许多技术更改,包括不对每个像素的滑动窗口进行下采样,以及混合使用线性模型(maxout)的多模式学习。第三,我们有一种Flickr的自动数据挖掘方法,该方法可以生成单词和字符级别的注释。最后,将这些组件一起使用以形成端到端的最新文本查找系统。我们以两个标准基准(ICDAR健壮读数数据集和街景文本数据集)评估文本发现系统,并展示了在多项措施方面的最新技术的改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号