首页> 外国专利> Text image processing using stroke-aware max-min pooling for OCR system employing artificial neural network

Text image processing using stroke-aware max-min pooling for OCR system employing artificial neural network

机译:使用人工神经网络的OCR系统中基于笔划感知最大-最小合并的文本图像处理

摘要

In an optical character recognition (OCR) method for digitizing printed text images using a long-short term memory (LSTM) network, text images are pre-processed using a stroke-aware max-min pooling method before being fed into the network, for both network training and OCR prediction. During training, an average stroke thickness is computed from the training dataset. Stroke-aware max-min pooling is applied to each text line image, where minimum pooling is applied if the stroke thickness of the line is greater than the average stroke thickness, while max pooling is applied if the stroke thickness is less than or equal to the average stroke thickness. The pooled images are used for network training. During prediction, stroke-aware max-min pooling is applied to each input text line image, and the pooled image is fed to the trained LSTM network to perform character recognition.
机译:在使用长期短期记忆(LSTM)网络对打印的文本图像进行数字化的光学字符识别(OCR)方法中,在将文本图像馈入网络之前,使用笔画感知的最大-最小合并方法对文本图像进行预处理。网络训练和OCR预测。在训练期间,从训练数据集计算平均笔划粗细。笔画感知的最大-最小合并应用于每个文本行图像,其中如果该行的笔划厚度大于平均笔划厚度,则应用最小合并,而如果笔划厚度小于或等于最大,则应用最大合并平均行程厚度。合并的图像用于网络训练。在预测期间,将笔画感知的最大-最小合并应用于每个输入文本行图像,并将合并的图像馈送到经过训练的LSTM网络以执行字符识别。

著录项

  • 公开/公告号US10373022B1

    专利类型

  • 公开/公告日2019-08-06

    原文格式PDF

  • 申请/专利权人 KONICA MINOLTA LABORATORY U.S.A. INC.;

    申请/专利号US201815908714

  • 发明设计人 YONGMIAN ZHANG;SHUBHAM AGARWAL;

    申请日2018-02-28

  • 分类号G06K9/62;G06N3/08;G06T7/12;G06K9;G06N3/04;

  • 国家 US

  • 入库时间 2022-08-21 12:14:00

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号