首页> 外文会议>12th International Conference on Frontiers in Handwriting Recognition >The Zone-Based Projection Distance Feature Extraction Method for Handwritten Numeral/Mixed Numerals Recognition of Indian Scripts
【24h】

The Zone-Based Projection Distance Feature Extraction Method for Handwritten Numeral/Mixed Numerals Recognition of Indian Scripts

机译:印度文字手写数字/混合数字识别的基于区域的投影距离特征提取方法

获取原文

摘要

Handwriting recognition has always been a challenging task in image processing and pattern recognition. India is a multi-lingual, multi-script country, where eighteen official scripts are accepted and there are over a hundred regional languages. The feature extraction method is probably the most effective method in achieving high recognition performance. In this study we proposed a zone-based feature extraction algorithm scheme for the recognition of off-line handwritten numerals of south-Indian scripts. The character centroid is computed and the characterumeral image (50×50) is further divided in to 25 equal zones (10×10). The average distance from the character centroid to the pixels present in the zone column was computed. This procedure was sequentially repeated for all the zone/grid/box columns present in the zone (10 features). This procedure was sequentially repeated for the entire zone present in the numeral image (250 features). Similarly, again the character centroid was computed and the image is further divided into 50 equal zones (5×10). The average distance from the image centroid to the pixels present in the zone was computed. This procedure was sequentially repeated for the entire zone present in the numeral image (50 features). There could be some zone/zone column that is empty of foreground pixels, then the feature value of that zone column/zone in the feature vector is zero. Finally, 300 such features were extracted for classification and recognition. The nearest neighbor, feed forward back propagation neural network and support vector machine classifiers were used for subsequent classification and recognition purposes. We obtained a recognition rate of 98.05, for Kannada numerals, 95.1 for Tamil numerals, 97.2 for Telugu numerals and 95.7 for Malayalam numerals using support vector machine.
机译:手写识别一直是图像处理和模式识别中的一项艰巨任务。印度是一个多语言,多文字的国家,该地区接受十八种官方文字,并且有一百多种区域语言。特征提取方法可能是实现高识别性能的最有效方法。在这项研究中,我们提出了一种基于区域的特征提取算法方案,用于识别南印度文字的离线手写数字。计算字符质心,并将字符/数字图像(50×50)进一步划分为25个相等的区域(10×10)。计算了字符质心到区域列中存在的像素的平均距离。对区域中存在的所有区域/网格/箱列依次重复执行此过程(10个功能)。对于数字图像(250个特征)中存在的整个区域,依次重复此过程。同样,再次计算字符质心,并将图像进一步划分为50个相等的区域(5×10)。计算了从图像质心到该区域中存在的像素的平均距离。对于数字图像中存在的整个区域(50个特征),依次重复此过程。可能有一些区域/区域列中没有前景像素,因此该区域列/区域的特征值在特征向量中为零。最后,提取了300个此类特征进行分类和识别。最近的邻居,前馈回传神经网络和支持向量机分类器用于后续分类和识别。使用支持向量机,我们对卡纳达语数字的识别率为98.05,对泰米尔语数字的识别率为95.1,对于泰卢固语数字的识别率为97.2,对于马拉雅拉姆语数字的识别率为95.7。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号