首页>
外国专利>
Text image processing using stroke-aware max-min pooling for OCR system employing artificial neural network
Text image processing using stroke-aware max-min pooling for OCR system employing artificial neural network
展开▼
机译:使用人工神经网络的OCR系统中基于笔划感知最大-最小合并的文本图像处理
展开▼
页面导航
摘要
著录项
相似文献
摘要
In an optical character recognition (OCR) method for digitizing printed text images using a long-short term memory (LSTM) network, text images are pre-processed using a stroke-aware max-min pooling method before being fed into the network, for both network training and OCR prediction. During training, an average stroke thickness is computed from the training dataset. Stroke-aware max-min pooling is applied to each text line image, where minimum pooling is applied if the stroke thickness of the line is greater than the average stroke thickness, while max pooling is applied if the stroke thickness is less than or equal to the average stroke thickness. The pooled images are used for network training. During prediction, stroke-aware max-min pooling is applied to each input text line image, and the pooled image is fed to the trained LSTM network to perform character recognition.
展开▼