首页> 外文会议>Computer vision, graphics and image processing >A Text Recognition Augmented Deep Learning Approach for Logo Identification

【24h】

A Text Recognition Augmented Deep Learning Approach for Logo Identification

机译：文本识别增强深度学习方法在徽标识别中的应用

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Logo/brand name detection and recognition in unstructured and highly unpredictable natural images has always been a challenging problem. We notice that in most natural images logos are accompanied with associated text. Therefore, we address the problem of logo recognition by first detecting and isolating text of varying color, font size and orientation in the input image using affine invariant maximally stable extremal regions (MSERs). Using an off-the-shelf OCR, we identify the text associated with the logo image. Then an effective grouping technique is employed to combine the remaining stable regions based on spatial proximity of MSERs. Deep learning has the advantage that optimal features can be learned automatically from image pixel data. This motivates us to feed the clustered logo candidate image regions to a pre-trained deep convolutional neural network (DCNN) to generate a set of complex features which are further input to a multiclass support vector machine (SVM) for classification. We tested our proposed logo recognition system on 32 logo classes, and a non-logo class obtained by combining FlickrLogos-32 and MICC logo databases, amounting to a total of 23582 training and testing images. Our method yields robust recognition performance, outperforming state-of-the-art techniques achieving 97.8% precision, 95.7% recall and 95.7% average accuracy on the combined MICC and FlickrLogos-32 datasets and a precision of 98.6%, recall of 97.9% and average accuracy of 99.6% on only the FlickrLogos-32 dataset.

机译：在非结构化和高度不可预测的自然图像中检测徽标/商标名称一直是一个具有挑战性的问题。我们注意到，在大多数自然图像中，徽标都带有相关的文字。因此，我们通过使用仿射不变最大稳定极值区域（MSER）首先检测和隔离输入图像中颜色，字体大小和方向变化的文本来解决徽标识别的问题。使用现成的OCR，我们可以识别与徽标图像关联的文本。然后，基于MSER的空间接近度，采用有效的分组技术组合其余的稳定区域。深度学习的优势在于可以从图像像素数据中自动学习最佳功能。这促使我们将聚类的徽标候选图像区域输入到预先训练的深度卷积神经网络（DCNN）中，以生成一组复杂的特征，这些特征将进一步输入到多类支持向量机（SVM）中进行分类。我们在32个徽标类上测试了我们提出的徽标识别系统，并通过组合FlickrLogos-32和MICC徽标数据库获得了一个非徽标类，总共有23582个训练和测试图像。我们的方法具有强大的识别性能，在结合了MICC和FlickrLogos-32数据集的情况下，性能优于最新技术，达到97.8％的精度，95.7％的查全率和95.7％的平均准确率，以及98.6％的查准率，97.9％的查全率和仅FlickrLogos-32数据集的平均准确度为99.6％。

著录项

来源
《Computer vision, graphics and image processing 》|2016年|145-156|共12页
会议地点 Guwahati(IN)
作者
Moushumi Medhi; Shubham Sinha; Rajiv Ranjan Sahay;
展开▼
作者单位

Computational Vision Lab, Department of Computer Science and Engineering, Indian Institute of Technology Kharagpur, Kharagpur 721302, West Bengal, India;

Department of Computer Science and Technology, Indian Institute of Engineering Science and Technology, Shibpur 711103, West Bengal, India;

Department of Electrical Engineering, Indian Institute of Technology Kharagpur, Kharagpur 721302, West Bengal, India;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Logo detection; Logo recognition; DCNN; MSER;

机译：标志检测；徽标识别； DCNN; SER;

相似文献

外文文献
中文文献
专利

1. Text Detection and Recognition for Images of Medical Laboratory Reports With a Deep Learning Approach [J] . Xue Wenyuan, Li Qingyong, Xue Qiyuan Quality Control, Transactions . 2020 ,第期

机译：深度学习方法医学实验室报告图像图像的文本检测与识别
2. GRAM-CNN: a deep learning approach with local context for named entity recognition in biomedical text [J] . Zhu Qile, Li Xiaolin, Conesa Ana, Bioinformatics . 2018 ,第9期

机译：Gram-CNN：具有本地背景的深度学习方法，用于生物医学文本中的名为实体识别
3. DLI-IT: a deep learning approach to drug label identification through image and text embedding [J] . Xiangwen Liu, Joe Meehan, Weida Tong, BMC Medical Informatics and Decision Making . 2020 ,第1期

机译：DLI-it：通过图像和文本嵌入药物标签识别的深入学习方法
4. A Text Recognition Augmented Deep Learning Approach for Logo Identification [C] . Moushumi Medhi, Shubham Sinha, Rajiv Ranjan Sahay Indian conference on vision, graphics and image processing . 2017

机译：徽标识别的文本识别增强深度学习方法
5. Learning Deep Image-Text Representations for Referring Visual Recognition [D] . Li, Ruiyu. 2018

机译：学习深度图像文本表示形式以引用视觉识别
6. GRAM-CNN: a deep learning approach with local context for named entity recognition in biomedical text [O] . Qile Zhu, Xiaolin Li, Ana Conesa, -1

机译：GRAM-CNN：一种具有本地上下文的深度学习方法用于生物医学文本中的命名实体识别
7. Text Detection and Recognition for Images of Medical Laboratory Reports With a Deep Learning Approach [O] . Wenyuan Xue, Qingyong Li, Qiyuan Xue 2020

机译：深度学习方法医学实验室报告图像图像的文本检测与识别

A Text Recognition Augmented Deep Learning Approach for Logo Identification

摘要

著录项

相似文献

相关主题

期刊订阅