首页> 外文会议>International Conference on Image and Graphics >A Weakly Supervised Text Detection Based on Attention Mechanism
【24h】

A Weakly Supervised Text Detection Based on Attention Mechanism

机译:基于注意机制的弱监督文本检测

获取原文

摘要

In this paper, we propose a new method for natural image text detection under a weakly supervised data set. Currently, most of the text detection models are based on bounding box label training data. However, the cost of the bounding box label training data is very high. In order to solve this problem, we propose an attention mechanism that can be trained on image-level labels data and roughly identifies text regions via an automatically learned attentional map based on a convolutional neural network. There are three main steps: firstly, a VGG model is trained using image-level labels data to score the likelihood that a text region exists in the picture; secondly, the region of interest is extracted by means of the attention mechanism and the extracted region is evaluated using the network trained in the first step to getting the text region and finally, the text line is extracted in the text region using the MSER algorithm. Trained with the weakly supervised data which is only with image-level labels, our model can generate bounding boxes for the text line in the image. The results of our model are very close to those of the models using bounding box label training data on the text detection benchmark sets of MSRA-TD500, ICDAR2013, and ICDAR2015.
机译:在本文中,我们提出了一种在弱监督数据集下自然图像文本检测方法。目前,大多数文本检测模型都基于边界框标签训练数据。但是,边界框标签训练数据的成本非常高。为了解决这个问题,我们提出了一种注意机制,可以在图像级标签数据上训练,并通过基于卷积神经网络自动学习的注意力地图大致识别文本区域。有三个主要步骤:首先,使用图像级标签数据训练VGG模型,以得分图片中存在文本区域的可能性;其次,通过注意机制提取感兴趣区域,并且使用在第一步中训练的网络评估提取的区域来获取文本区域,最后,使用MSER算法在文本区域中提取文本线。使用弱监管数据仅具有图像级标签,我们的模型可以为图像中的文本行生成边界框。我们的模型的结果非常接近使用MSRA-TD500,ICDAR2013和ICDAR2015的文本检测基准组上的边界框标签培训数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号