首页> 外文期刊>Advances in Computer Science and Information Technology: ACSIT >An Efficient Algorithm for Characters recognition of Printed Oriya Script
【24h】

An Efficient Algorithm for Characters recognition of Printed Oriya Script

机译:一种有效的打印oriya脚本的字符识别算法

获取原文
           

摘要

—The subject of character recognition has received considerable attention in recent years. Character recognition is a process of converting handwritten or printed text images into machine readable code or text. Optical character recognition is used for many applications such as 1) Handwriting recognition systems, 2) Number plate recognition systems, 3) text recognition systems, 4) Data entry for business documents. In this, we are concerned with the recognition of printed Oriya script a popular Indian script. The development of OCR for this script is challenging as number of identified classes are more than 380. In the proposed approach, the digitized document image is first passed through preprocessing modules. The preprocessed data is segmented to the symbol level using horizontal and_ vertical projection profiling. The proposed approach use HOG feature extraction technique and SVM (Support Vector Machine) is used as a classifier. This approach is able to distinguishing between characters that have very similar shapes. A prototype of the system has been tested on a variety of printed Oriya material, and currently achieves 97.2% character level accuracy on average.
机译:- 近年来,性格识别的主题得到了相当大的关注。字符识别是将手写或打印文本图像转换为机器可读代码或文本的过程。光学字符识别用于许多应用程序,例如1)手写识别系统,2)号码板识别系统,3)文本识别系统,4)用于业务文档的数据条目。在这方面,我们担心印刷的oriya脚本是一个受欢迎的印度剧本。对于此脚本的OCR的开发是具有挑战性的,因为所识别的类数量超过380.在所提出的方法中,首先通过预处理模块传递数字化文档图像。使用水平和垂直投影分析将预处理数据分段为符号级别。所提出的方法使用HOG特征提取技术和SVM(支持向量机)用作分类器。这种方法能够区分具有非常相似的形状的字符。该系统的原型已经在各种印刷的oriya材料上进行了测试,并且目前平均实现了97.2%的性格水平精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号