...
首页> 外文期刊>Computers, Materials & Continua >Text Detection and Recognition for Natural Scene Images Using Deep Convolutional Neural Networks
【24h】

Text Detection and Recognition for Natural Scene Images Using Deep Convolutional Neural Networks

机译:使用深卷积神经网络的自然场景图像的文本检测与识别

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Words are the most indispensable information in human life. It is very important to analyze and understand the meaning of words. Compared with the general visual elements, the text conveys rich and high-level moral information, which enables the computer to better understand the semantic content of the text. With the rapid development of computer technology, great achievements have been made in text information detection and recognition. However, when dealing with text characters in natural scene images, there are still some limitations in the detection and recognition of natural scene images. Because natural scene image has more interference and complexity than text, these factors make the detection and recognition of natural scene image text face many challenges. To solve this problem, a new text detection and recognition method based on depth convolution neural network is proposed for natural scene image in this paper. In text detection, this method obtains high-level visual features from the bottom pixels by ResNet network, and extracts the context features from character sequences by BLSTM layer, then introduce to the idea of faster R-CNN vertical anchor point to find the bounding box of the detected text, which effectively improves the effect of text object detection. In addition, in text recognition task, DenseNet model is used to construct character recognition based on Kares. Finally, the output of Softmax is used to classify each character. Our method can replace the artificially defined features with automatic learning and context-based features. It improves the efficiency and accuracy of recognition, and realizes text detection and recognition of natural scene images. And on the PAC2018 competition platform, the experimental results have achieved good results.
机译:单词是人类生活中最不可或缺的信息。分析和理解单词的含义非常重要。与一般的视觉元素相比,文本传达了丰富和高级别的道德信息,这使得计算机能够更好地了解文本的语义内容。随着计算机技术的快速发展,文本信息检测和识别方面取得了巨大成就。但是,在处理自然场景图像中的文本字符时,在自然场景图像的检测和识别中仍然存在一些局限性。因为自然场景图像具有比文本更大的干扰和复杂性,所以这些因素使自然场景图像文本的检测和识别面临着许多挑战。为了解决这个问题,提出了一种基于深度卷积神经网络的新文本检测和识别方法,用于本文的自然场景图像。在文本检测中,该方法通过Reset网络从底部像素获得高级视觉功能,并通过BLSTM层从字符序列中提取上下文特征,然后介绍更快的R-CNN垂直锚点的思想,找到边界框检测到的文本,有效提高了文本对象检测的效果。此外,在文本识别任务中,DenSenet模型用于构建基于kares的字符识别。最后,softmax的输出用于对每个字符进行分类。我们的方法可以用自动学习和基于上下文的特征替换人工定义的功能。它提高了识别的效率和准确性,实现了自然场景图像的文本检测和识别。在PAC2018竞争平台上,实验结果取得了良好的效果。

著录项

  • 来源
    《Computers, Materials & Continua》 |2019年第1期|289-300|共12页
  • 作者单位

    School of Computer Science Chengdu University of Information Technology Chengdu 610225 China;

    School of Computer Science Chengdu University of Information Technology Chengdu 610225 China;

    School of Computer Science University of Nottingham Jubilee Campus NG8 1BB UK;

    School of Computer Science Chengdu University of Information Technology Chengdu 610225 China;

    School of Computer Science Chengdu University of Information Technology Chengdu 610225 China School of Information and Software Engineering University of Electronic Science and Technology of China Chengdu 610054 China;

    School of Computer Science Chengdu University of Information Technology Chengdu 610225 China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Detection; recognition; resnet; blstm; faster R-CNN; densenet;

    机译:检测;认出;reset;Blstm;更快的R-CNN;Densenet.;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号