首页> 外文期刊>Multimedia, IEEE Transactions on >A Convolutional Neural Network-Based Chinese Text Detection Algorithm via Text Structure Modeling
【24h】

A Convolutional Neural Network-Based Chinese Text Detection Algorithm via Text Structure Modeling

机译:基于卷积神经网络的文本结构建模的中文文本检测算法

获取原文
获取原文并翻译 | 示例
           

摘要

Text detection in a natural environment plays an important role in many computer vision applications. While existing text detection methods are focused on English characters, there are strong application demands on text detection in other languages, such as Chinese. In this paper, we present a novel text detection algorithm for Chinese characters based on a specific designed convolutional neural network (CNN). The CNN contains a text structure component detector layer, a spatial pyramid layer, and a multi-input-layer deep belief network (DBN). The CNN is pre-trained via a convolutional sparse auto-encoder, specifically designed for extracting complex features from Chinese characters. In particular, the text structure component detectors enhance the accuracy and uniqueness of feature descriptors by extracting multiple text structure components in various ways. The spatial pyramid layer enhances the scale invariability of the CNN for detecting texts in multiple scales. Finally, the multi-input-layer DBN replaces the fully connected layers in the CNN to ensure features from multiple scales are comparable. A multilingual text detection dataset, in which texts in Chinese, English, and digits are labeled separately, is set up to evaluate the proposed text detection algorithm. The proposed algorithm shows a significant performance improvement over the baseline CNN algorithms. In addition the proposed algorithm is evaluated over a public multilingual benchmark and achieves state-of-the-art result under multiple languages. Furthermore, a simplified version of the proposed algorithm with only general components is evaluated on the ICDAR 2011 and 2013 datasets, showing comparable detection performance to the existing general text detection algorithms.
机译:自然环境中的文本检测在许多计算机视觉应用中都扮演着重要角色。虽然现有的文本检测方法集中于英文字符,但是对其他语言(例如中文)的文本检测有强烈的应用需求。在本文中,我们提出了一种基于特定设计的卷积神经网络(CNN)的新颖的汉字文本检测算法。 CNN包含文本结构组件检测器层,空间金字塔层和多输入层深度信任网络(DBN)。通过卷积稀疏自动编码器对CNN进行预训练,该卷积稀疏自动编码器专门设计用于从汉字中提取复杂特征。特别地,文本结构成分检测器通过以各种方式提取多个文本结构成分来提高特征描述符的准确性和唯一性。空间金字塔层增强了CNN的尺度不变性,可用于检测多个尺度的文本。最后,多输入层DBN取代了CNN中的完全连接层,以确保来自多个比例的要素具有可比性。建立了多语言文本检测数据集,其中分别标记了中文,英文和数字的文本,以评估提出的文本检测算法。所提出的算法相对于基础CNN算法显示出显着的性能改进。另外,所提出的算法在公共多语言基准上进行了评估,并在多种语言下获得了最新的结果。此外,在ICDAR 2011和2013数据集上评估了仅具有通用组件的拟议算法的简化版本,显示了与现有通用文本检测算法相当的检测性能。

著录项

  • 来源
    《Multimedia, IEEE Transactions on》 |2017年第3期|506-518|共13页
  • 作者单位

    Department of Electronic Engineering, Institute of Image Communication and Network Engineering, Shanghai Jiao Tong University, Shanghai, China;

    Department of Electronic Engineering, Institute of Image Communication and Network Engineering, Shanghai Jiao Tong University, Shanghai, China;

    School of Engineering and Applied Science, Aston University, Birmingham, U.K.;

    Department of Electronic Engineering, Institute of Image Communication and Network Engineering, Shanghai Jiao Tong University, Shanghai, China;

    Department of Electronic Engineering, Institute of Image Communication and Network Engineering, Shanghai Jiao Tong University, Shanghai, China;

    Department of Electronic Engineering, Institute of Image Communication and Network Engineering, Shanghai Jiao Tong University, Shanghai, China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Feature extraction; Detection algorithms; Detectors; Neural networks; Unsupervised learning; Machine learning; Image edge detection;

    机译:特征提取;检测算法;检测器;神经网络;无监督学习;机器学习;图像边缘检测;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号