首页> 外文会议>IEEE Winter Applications of Computer Vision Workshops >Memory-Efficient Models for Scene Text Recognition via Neural Architecture Search
【24h】

Memory-Efficient Models for Scene Text Recognition via Neural Architecture Search

机译:通过神经体系结构搜索实现场景文本识别的内存有效模型

获取原文

摘要

Meta-learning techniques based on neural architecture search (NAS) show excellent performance in the design of learning models used in deep neural networks. In particular, when NAS is applied to design a convolutional neural network (CNN) for image recognition, the performance of the network when evaluating public benchmark datasets such as CIFAR10 and ImageNet exceeds that of hand-designed models. Nevertheless, there are very few cases wherein NAS has been applied to real-world problems, i.e. recognition problems with a limited dataset. We proposed a method in which the NAS technique does not require a proxy task for the scene text recognition (STR) framework to apply the NAS method to a new image recognition field. Therefore, we proposed an architecture space for CNN-based modules in the STR framework and applied the ProxylessNAS method, enabling end-to-end training while meta learners design a new model that requires only a single commonly used GPU (approximately 100 GPU hours). To evaluate the STR model obtained by the proposed NAS method, seven STR benchmark datasets were used. Finally, the obtained model could achieve a performance similar to that of the ideal model in terms of accuracy and number of parameters. We thus confirm that the model design based on NAS can be effectively applied to STR scenarios.
机译:基于神经体系结构搜索(NAS)的元学习技术在深度神经网络中使用的学习模型的设计中显示出出色的性能。特别是,当将NAS用于设计用于图像识别的卷积神经网络(CNN)时,评估公共基准数据集(如CIFAR10和ImageNet)时网络的性能超过了手工设计的模型。然而,在极少数情况下,NAS已被应用于实际问题,即数据集有限的识别问题。我们提出了一种方法,其中NAS技术不需要场景文本识别(STR)框架的代理任务即可将NAS方法应用于新的图像识别领域。因此,我们在STR框架中为基于CNN的模块提出了一个架构空间,并应用了ProxylessNAS方法,从而实现了端到端培训,而元学习者设计了一个仅需一个常用GPU(大约100个GPU小时)的新模型。 。为了评估通过建议的NAS方法获得的STR模型,使用了七个STR基准数据集。最后,在精度和参数数量方面,所获得的模型可以实现与理想模型相似的性能。因此,我们确认基于NAS的模型设计可以有效地应用于STR场景。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号