Multi-font printed Chinese character recognition using multi-pooling convolutional neural network

机译：使用多池卷积神经网络的多字体印刷汉字识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Although previous studies have achieved effective printed Chinese character recognition (PCCR) in the case a single font or a few different fonts, large scale multi-font PCCR remains a major challenge owing to the wide variety in the shape, layout, and grey-level distribution of single Chinese characters across different font styles. This paper applies multi-pooling and data augmentation with non-linear transformation to a convolutional neural network (CNN) for multi-font PCCR. We propose a multi-pooling layer on top of the final convolutional layer; this approach is found to be robust to spatial layout variations and deformations in multi-font printed Chinese characters. Experimental results show that multi-pooling significantly improves CNN performance. In addition, we adopt a distorted sample generation technique by applying non-linear warping functions along an original font image, which distorts the local density of image-based Chinese character strokes. We find that CNN performance is further boosted by the distorted samples technique. An input character image is transformed into four distorted images and the CNN learns the original image as well as the distorted samples to classify 3755 classes (level-1 set of GB2312-80) of printed Chinese characters in 280 widely varying fonts and 120 manually selected fonts. Outstanding recognition rates of 94.38% and 99.74% are achieved in the former and latter cases, respectively, which indicates the effectiveness of the proposed methods.

机译：虽然以前的研究在单一字体或几个不同的字体的情况下取得了有效的汉字识别（PCCR），但大规模的多字体PCCR仍然是由于形状，布局和灰度级的各种各样的主要挑战不同字体样式的单一汉字分布。本文应用多池和数据增强与非线性变换到多字体PCCR的卷积神经网络（CNN）。我们在最终卷积层顶部提出了一种多池层;发现这种方法对空间布局变体和多字体印刷汉字的变形具有强大。实验结果表明，多池显着提高了CNN性能。此外，我们通过沿着原始字体图像应用非线性翘曲功能来采用扭曲的样本生成技术，这扭曲了基于图像的汉字笔画的局部密度。我们发现通过扭曲的样本技术进一步提高了CNN性能。将输入字符图像转换为四个失真的图像，CNN学习原始图像以及扭曲的样本，以在280个广泛变化的字体中对打印的汉字进行分类为3755类（Level-1组GB2312-80），以为手动选择120个字体。在前者和后一种情况下，出色的识别率为94.38％和99.74％，这表明了所提出的方法的有效性。

著录项

来源
《International Conference on Document Analysis and Recognition》|2015年||共5页
会议地点
作者
Zhuoyao Zhong; Lianwen Jin; Ziyong Feng;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
character recognition; convolution; distortion; neural nets; CNN; PCCR; convolutional neural network; data augmentation; distorted sample generation technique; final convolutional layer; font styles; image-based Chinese character stroke local density; multifont printed Chinese character recognition; multipooling convolutional neural network; nonlinear transformation; nonlinear warping function; printed Chinese character recognition; single Chinese character grey-level distribution; single Chinese character layout; single Chinese character shape; spatial layout variations; Artificial neural networks; Handwriting recognition; Image recognition; Robustness; Data augmentation; Multi-font; Printed Chinese character recognition; convolutional neural network;

机译：字符识别;卷积;扭曲;CNN;PCCR;卷积神经网络;数据增强;扭曲的样本生成技术;最终卷积层;字体样式;基于图像的汉字冲程局部密度;多功能印刷的汉字识别;多功能卷积的神经网络;非线性变换;非线性翘曲功能;印刷汉字识别;单一汉字灰级分布;单一汉字布局;单一汉字形状;空间布局变化;人工神经网络;手写识别;图像识别;鲁棒性;数据增强;多字体;印刷汉字识别;卷积神经网络;

相似文献

外文文献
中文文献
专利

1. Applying hashing search and fuzzy fault-tolerant algorithms for the fast recognition of multi-font printed chinese characters [J] . Ji-Rong Lin, Chang-Fuu Chen Journal of the Chinese Institute of Engineers . 1998,第4期

机译：应用哈希搜索和模糊容错算法快速识别多字体印刷汉字
2. A neural network-based approach for recognizing multi-font printed English characters [J] . Najmeh Samadiani, Hamid Hassanpour Journal of Electrical Systems and Information Technology . 2015,第2期

机译：基于神经网络的多字体印刷英文字符识别方法
3. The Handwritten Chinese Character Recognition Uses Convolutional Neural Networks with the GoogLeNet [J] . Bi Ning, Chen Jiahao, Tan Jun International Journal of Pattern Recognition and Artificial Intelligence . 2019,第11期

机译：手写汉字识别通过GoogLeNet使用卷积神经网络
4. Multi-font printed Chinese character recognition using multi-pooling convolutional neural network [C] . Zhuoyao Zhong, Lianwen Jin, Ziyong Feng International Conference on Document Analysis and Recognition . 2015

机译：多池卷积神经网络的多字体印刷汉字识别
5. On-line character recognition of handprinted Chinese characters using fuzzy measuring and structural analysis. [D] . Yeh, Song-Shen. 1995

机译：基于模糊测量和结构分析的手印汉字在线字符识别。
6. Handwritten Bangla Character Recognition Using the State-of-the-Art Deep Convolutional Neural Networks [O] . Md Zahangir Alom, Paheding Sidike, Mahmudul Hasan, 2018

机译：使用最先进的深度卷积神经网络进行手写Bangla字符识别
7. A neural network-based approach for recognizing multi-font printed English characters [O] . Samadiani Najmeh, Hassanpour Hamid 2015

机译：基于神经网络的多字体印刷英文字符识别方法

Multi-font printed Chinese character recognition using multi-pooling convolutional neural network

摘要

著录项

相似文献

相关主题

期刊订阅