Evaluation of Neural Based Feature Extraction Methods for Printed Telugu OCR System

M Swamy Das; Ram Mohan Rao Kovvur

首页> 外文期刊>Advances in Computer Science and Information Technology: ACSIT >Evaluation of Neural Based Feature Extraction Methods for Printed Telugu OCR System

【24h】

Evaluation of Neural Based Feature Extraction Methods for Printed Telugu OCR System

机译：印刷遥控系统的神经基特征提取方法的评价

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The Telugu is one of the oldest and most popular languages of India, especially in South India. The reported works on development of optical character recognition (OCR) systems for Telugu script is little. Moreover, Telugu is a complex script in which the characters are made up of one or more connected components resulting in a huge number of possible combinations, running into hundreds of thousands. In any OCR system, feature extraction is one of the most important phases. There are several methods that are suitable for different language scripts. These methods are broadly classified into template base, structural, statistical, neural network based and SVM based. In this paper we describe various feature extraction methods and evaluate by applying to Telugu script. In this process we have identified diagonal based, geometrical based and distance metric based feature extraction methods and also proposed a Pixel based feature extraction method. All these methods are implemented and evaluated with 364 Telugu characters using multilayer neural network as a classifier. The recognition accuracies of geometrical, diagonal, pixelmap and distance metric based feature extraction methods are 98.6%, 100%, 98.31% and 99.32% respectively. From the experiment it is understood that diagonal based method most suitable for Telugu script than other feature extraction methods.

机译：Telugu是印度最古老，最流行的语言之一，特别是在印度南部。报道的关于泰卢固定脚本的光学字符识别（OCR）系统的开发工作很少。此外，Telugu是一个复杂的脚本，其中字符由一个或多个连接组件组成，从而产生大量可能的组合，运行成数十万。在任何OCR系统中，特征提取是最重要的阶段之一。有几种方法适用于不同的语言脚本。这些方法广泛分为模板基础，结构，统计，神经网络基于SVM。在本文中，我们描述了各种特征提取方法，并通过申请Teludu脚本来评估。在该过程中，我们已经识别基于对角线的基于几何和距离公制的特征提取方法，并且还提出了一种基于像素的特征提取方法。使用多层神经网络作为分类器的364个Teludu字符来实现和评估所有这些方法。几何，对角线，PIXELMAP和距离公制的特征提取方法的识别精度分别为98.6％，100％，98.31％和99.32％。从实验开始，据了解，基于对角线的方法，最适合Telugu脚本的方法比其他特征提取方法。

著录项

来源
《Advances in Computer Science and Information Technology: ACSIT》 |2015年第11期|共6页
作者
M Swamy Das; Ram Mohan Rao Kovvur;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Feature evaluation and extraction based on neural network in analog circuit fault diagnosis [J] . Yuan Haiying, Chen Guangju, Xie Yongle 系统工程与电子技术（英文版） . 2007,第002期
2. An Efficient Method for Iris Feature Extraction Based on Pulse Coupled Neural Network [J] . ZHANG Zaifeng, XU Guangzhu, MA Yide 电子学报：英文版 . 2008,第002期
3. Risk based security assessment of power system using generalized regression neural network with feature extraction [J] . M.Marsadek, A.Mohamed 中南大学学报（英文版） . 2013,第002期
4. A fault diagnosis method of reciprocating compressor based on sensitive feature evaluation and artificial neural network [J] . Xing Chenghong, Xu Fengtian, Yao Ziyun, 高技术通讯（英文版） . 2015,第004期
5. A Structural Analysis Based Feature Extraction Method for OCR System For Myanmar Printed Document Images [J] . Htwe Pa Pa Win, Phyo Thu Thu Khine, KhinNweNi Tun International journal of computer vision and iImage processing . 2012,第1期

机译：基于结构分析的缅甸印刷文档图像OCR系统特征提取方法
6. An OCR Free Method for Word Spotting in Printed Documents: the Evaluation of Different Feature Sets [J] . Israel Rios, Alceu de Souza Britto Jr, Alessandro Lameiras Koerich, Journal of Universal Computer Science . 2011,第1期

机译：一种无OCR的打印文档中单词斑点的方法：不同功能集的评估
7. An OCR Free Method for Word Spotting in Printed Documents: the Evaluation of Different Feature Sets [J] . Israel Rios, Alceu de Souza Britto Jr, Alessandro Lameiras Koerich, Journal of Universal Computer Science . 2011,第1期

机译：一种无OCR的打印文档中单词斑点的方法：不同功能集的评估
8. Evaluation of Different Feature Sets in an OCR Free Method for Word Spotting in Printed Documents [C] . Annual ACM symposium on applied computing . 2010

机译：在印刷文档中单词斑点的OCR自由方法中的不同特征集评估
9. Forest terrain feature characterization using multi-sensor neural image fusion and feature extraction methods. [D] . Pugh, Mark L. 2005

机译：使用多传感器神经图像融合和特征提取方法表征森林地形特征。
10. A Study of Various Feature Extraction Methods on a Motor Imagery Based Brain Computer Interface System [O] . Seyed Navid Resalat, Valiallah Saba 2016

机译：基于运动图像的脑计算机接口系统的各种特征提取方法研究
11. Evaluation of Different Feature Sets in an OCR Free Method for Word Spotting in Printed Documents [O] . Israel Rios, Alceu S. Britto, Alessandro L. Koerich, 2011

机译：印刷文档中Word识别的OCR自由方法中不同特征集的评价
12. Target Identification Using Wavelet-based Feature Extraction and Neural Network Classifiers [R] . Lopez, J. E. , Chen, H. H. , Saulnier, J. 1999

机译：基于小波特征提取和神经网络分类器的目标识别

Evaluation of Neural Based Feature Extraction Methods for Printed Telugu OCR System

摘要

著录项

相似文献

相关主题

期刊订阅