首页> 外文会议>International Conference- Confluence The Next Generation Information Technology Summit >Recognition of off-line hand printed English Characters, Numerals and Special Symbols
【24h】

Recognition of off-line hand printed English Characters, Numerals and Special Symbols

机译:识别离线手印刷英文字符,数字和特殊符号

获取原文

摘要

The generic process of Optical Character Recognition (OCR), an area of intensive research in the field of Artificial Intelligence, Pattern Recognition and Computer Vision, aims to recognize text from scanned document images, where data can be in machine printed or hand written format. Optical Character Recognition can improve the interaction between man and machine in various applications including data entry, office automation, digital library, banking applications, health insurance and tax forms etc. Much of work has been done in the recognition of machine printed characters in various languages with considerably good efficiencies, however making robust recognition engines that can be put to recognize hand written and hand printed data with commendable recognition rates still remains as an active area of research owing to the challenges like diverse human handwriting style, variation in shape, angle and style of characters. Taking into account the challenges and scope for improvement in this domain, the work of off-line character recognition of hand printed document images containing English Characters-Uppercase and Lowercase, Numerals and Special Characters has been presented. Statistical, Geometric and Directional Feature Extraction techniques have been applied over segmented character image. Classification was done using Multilayer perception neural network (NN) with back propagation and Support vector machine (SVM) classifier. The recognition rates achieved were up to 98% for Numerals, 96.5% for Special Characters, 95.35% for Uppercase Characters, 92% for Lowercase Characters. The system for combined data set-Characters, Numerals and Special Symbols resulted out to be 92.167% accurate, using SVM as classifier.
机译:光学字符识别(OCR)的通用过程,人工智能领域的密集研究领域,旨在识别来自扫描文档图像的文本,其中数据可以是机器印刷或手写写入格式。光学字符识别可以提高各种应用中的人员和机器之间的交互,包括数据输入,办公自动化,数字图书馆,银行应用,健康保险和税收形式等。在各种语言中的机器印刷人物的识别方面已经完成了大部分工作由于诸多识别手写和手动印刷数据的强大效率,由于不同人类手写风格,形状,角度变化的挑战,仍然是具有值得称道的识别率的手写和手动印刷数据仍然是一个活跃的研究领域。风格的人物。考虑到该领域改进的挑战和范围,已经介绍了包含英文字符和小写,数字和特殊字符的手印刷文档图像的离线字符识别的工作。统计,几何和方向特征提取技术已在分段字符图像上施加。使用多层感知神经网络(NN)进行分类,背部传播和支持向量机(SVM)分类器。数值的识别率高达98%,特殊字符为96.5%,大写字符为95.35%,小写字符为92%。使用SVM作为分类器,组合数据集合,数字和特殊符号的组合系统为92.167%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号