首页> 外文会议>Computer vision, graphics and image processing >A Text Recognition Augmented Deep Learning Approach for Logo Identification
【24h】

A Text Recognition Augmented Deep Learning Approach for Logo Identification

机译:文本识别增强深度学习方法在徽标识别中的应用

获取原文
获取原文并翻译 | 示例

摘要

Logo/brand name detection and recognition in unstructured and highly unpredictable natural images has always been a challenging problem. We notice that in most natural images logos are accompanied with associated text. Therefore, we address the problem of logo recognition by first detecting and isolating text of varying color, font size and orientation in the input image using affine invariant maximally stable extremal regions (MSERs). Using an off-the-shelf OCR, we identify the text associated with the logo image. Then an effective grouping technique is employed to combine the remaining stable regions based on spatial proximity of MSERs. Deep learning has the advantage that optimal features can be learned automatically from image pixel data. This motivates us to feed the clustered logo candidate image regions to a pre-trained deep convolutional neural network (DCNN) to generate a set of complex features which are further input to a multiclass support vector machine (SVM) for classification. We tested our proposed logo recognition system on 32 logo classes, and a non-logo class obtained by combining FlickrLogos-32 and MICC logo databases, amounting to a total of 23582 training and testing images. Our method yields robust recognition performance, outperforming state-of-the-art techniques achieving 97.8% precision, 95.7% recall and 95.7% average accuracy on the combined MICC and FlickrLogos-32 datasets and a precision of 98.6%, recall of 97.9% and average accuracy of 99.6% on only the FlickrLogos-32 dataset.
机译:在非结构化和高度不可预测的自然图像中检测徽标/商标名称一直是一个具有挑战性的问题。我们注意到,在大多数自然图像中,徽标都带有相关的文字。因此,我们通过使用仿射不变最大稳定极值区域(MSER)首先检测和隔离输入图像中颜色,字体大小和方向变化的文本来解决徽标识别的问题。使用现成的OCR,我们可以识别与徽标图像关联的文本。然后,基于MSER的空间接近度,采用有效的分组技术组合其余的稳定区域。深度学习的优势在于可以从图像像素数据中自动学习最佳功能。这促使我们将聚类的徽标候选图像区域输入到预先训练的深度卷积神经网络(DCNN)中,以生成一组复杂的特征,这些特征将进一步输入到多类支持向量机(SVM)中进行分类。我们在32个徽标类上测试了我们提出的徽标识别系统,并通过组合FlickrLogos-32和MICC徽标数据库获得了一个非徽标类,总共有23582个训练和测试图像。我们的方法具有强大的识别性能,在结合了MICC和FlickrLogos-32数据集的情况下,性能优于最新技术,达到97.8%的精度,95.7%的查全率和95.7%的平均准确率,以及98.6%的查准率,97.9%的查全率和仅FlickrLogos-32数据集的平均准确度为99.6%。

著录项

  • 来源
  • 会议地点 Guwahati(IN)
  • 作者单位

    Computational Vision Lab, Department of Computer Science and Engineering, Indian Institute of Technology Kharagpur, Kharagpur 721302, West Bengal, India;

    Department of Computer Science and Technology, Indian Institute of Engineering Science and Technology, Shibpur 711103, West Bengal, India;

    Department of Electrical Engineering, Indian Institute of Technology Kharagpur, Kharagpur 721302, West Bengal, India;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Logo detection; Logo recognition; DCNN; MSER;

    机译:标志检测;徽标识别; DCNN; SER;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号