首页> 外文期刊>IEEE Transactions on Pattern Analysis and Machine Intelligence >High accuracy optical character recognition using neural networks with centroid dithering
【24h】

High accuracy optical character recognition using neural networks with centroid dithering

机译:使用带质心抖动的神经网络进行高精度光学字符识别

获取原文
获取原文并翻译 | 示例

摘要

Optical character recognition (OCR) refers to a process whereby printed documents are transformed into ASCII files for the purpose of compact storage, editing, fast retrieval, and other file manipulations through the use of a computer. The recognition stage of an OCR process is made difficult by added noise, image distortion, and the various character typefaces, sizes, and fonts that a document may have. In this study a neural network approach is introduced to perform high accuracy recognition on multi-size and multi-font characters; a novel centroid-dithering training process with a low noise-sensitivity normalization procedure is used to achieve high accuracy results. The study consists of two parts. The first part focuses on single size and single font characters, and a two-layered neural network is trained to recognize the full set of 94 ASCII character images in 12-pt Courier font. The second part trades accuracy for additional font and size capability, and a larger two-layered neural network is trained to recognize the full set of 94 ASCII character images for all point sizes from 8 to 32 and for 12 commonly used fonts. The performance of these two networks is evaluated based on a database of more than one million character images from the testing data set.
机译:光学字符识别(OCR)是指通过使用计算机将打印文档转换为ASCII文件,以进行紧凑存储,编辑,快速检索和其他文件操作的过程。由于增加了噪声,图像失真以及文档可能具有的各种字符字体,大小和字体,使得OCR处理的识别阶段变得困难。在这项研究中,引入了一种神经网络方法来对多尺寸和多字体字符执行高精度识别。一种具有低噪声敏感度归一化程序的新颖质心抖动训练过程可用于获得高精度结果。该研究包括两个部分。第一部分着重于单个大小和单个字体字符,并且训练了一个两层神经网络以识别12点Courier字体的全套94个ASCII字符图像。第二部分将精度与其他字体和大小功能进行了权衡,并且训练了一个较大的两层神经网络,以识别针对从8到32的所有点大小以及12种常用字体的全套94个ASCII字符图像。基于来自测试数据集的超过一百万个字符图像的数据库对这两个网络的性能进行了评估。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号