Deep Features for Text Spotting

机译：文字斑点的深层功能

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The goal of this work is text spotting in natural images. This is divided into two sequential tasks: detecting words regions in the image, and recognizing the words within these regions. We make the following contributions: first, we develop a Convolutional Neural Network (CNN) classifier that can be used for both tasks. The CNN has a novel architecture that enables efficient feature sharing (by using a number of layers in common) for text detection, character case-sensitive and insensitive classification, and bigram classification. It exceeds the state-of-the-art performance for all of these. Second, we make a number of technical changes over the traditional CNN architectures, including no downsampling for a per-pixel sliding window, and multi-mode learning with a mixture of linear models (maxout). Third, we have a method of automated data mining of Flickr, that generates word and character level annotations. Finally, these components are used together to form an end-to-end, state-of-the-art text spotting system. We evaluate the text-spotting system on two standard benchmarks, the ICDAR Robust Reading data set and the Street View Text data set, and demonstrate improvements over the state-of-the-art on multiple measures.

机译：这项工作的目标是在自然图像中发现文本。这分为两个连续的任务：检测图像中的单词区域，并识别这些区域中的单词。我们做出了以下贡献：首先，我们开发了可用于两个任务的卷积神经网络（CNN）分类器。 CNN具有新颖的体系结构，可实现有效的特征共享（通过使用多个公共层）以进行文本检测，区分大小写和不区分大小写的字符分类以及bigram分类。在所有这些方面，它都超过了最先进的性能。其次，我们对传统的CNN架构进行了许多技术更改，包括不对每个像素的滑动窗口进行下采样，以及混合使用线性模型（maxout）的多模式学习。第三，我们有一种Flickr的自动数据挖掘方法，该方法可以生成单词和字符级别的注释。最后，将这些组件一起使用以形成端到端的最新文本查找系统。我们以两个标准基准（ICDAR健壮读数数据集和街景文本数据集）评估文本发现系统，并展示了在多项措施方面的最新技术的改进。

著录项

来源
《European conference on computer vision》|2014年|512-528|共17页
会议地点
作者
Max Jaderberg; Andrea Vedaldi; Andrew Zisserman;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Word spotting and recognition via a joint deep embedding of image and text [J] . Mhiri Mohamed, Desrosiers Christian, Cheriet Mohamed Pattern Recognition: The Journal of the Pattern Recognition Society . 2019,第期

机译：通过联合嵌入图像和文本的联合深度嵌入单词
2. Text detection in natural images with hybrid stroke feature transform and high performance deep Convnet computing [J] . Vidhyalakshmi M., Sudha S. Concurrency, practice and experience . 2021,第3期

机译：具有混合行程特征变换和高性能深度ConvNet计算的自然图像中的文本检测
3. Analysis of Text Feature Extractors using Deep Learning on Fake News [J] . B.Ahmed, A.Hussain, A.Baseer, Engineering Technology and Applied Science Research . 2021,第2期

机译：用深入学习对文本特征提取器的分析
4. Deep Features for Text Spotting [C] . Max Jaderberg, Andrea Vedaldi, Andrew Zisserman ECCV 2014 . 2014

机译：文本斑点的深度特征
5. Robust Text Spotting in Natural Images with Deep Neural Networks [D] . ?Yang, Xiao 2019

机译：具有深层神经网络的自然图像中的强大文本斑点
6. Classification of Biomedical Texts for Cardiovascular Diseases with Deep Neural Network Using a Weighted Feature Representation Method [O] . Nizar Ahmed, Fatih Dilmaç, Adil Alpkocak 2020

机译：使用加权特征表示方法对深神经网络的生物医学文本的分类
7. Bridging text spotting and SLAM with junction features [O] . Finn Chelsea, Kaess Michael, Teller Seth, 2015

机译：通过连接功能桥接文本定位和sLam

Deep Features for Text Spotting

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅