【24h】

Classification of offline gujarati handwritten characters

机译:离线古吉拉特语手写字符的分类

获取原文

摘要

Intelligent Character Recognition (ICR) is a specific form of optical character recognition (OCR) dealing mostly with handwritten texts. Due to their specificity, they are usually more adept in interpreting different styles and fonts of handwriting providing eventually higher recognition rates. Factors like language constructs, amount of research on ICR pertaining to the language, etc., essentially determines the amount of success achieved in its character recognition. This research mainly deals with the recognition of Gujarati Handwritten Characters. We have considered 34 consonants and 5 vowels; a total of 39 Gujarati Characters. The structure and lexicons of the language posed a challenge during the initial phase of segmentation; for that we have proposed new algorithm for segmentation. Our segmentation algorithm is able to address these concerns effectively. Different algorithms from different domains have been considered for comparative analysis like Transform Domain (DWT, DCT and DFT), from Spatial Domain; Geometric Method (Gradient feature), Structural method (Freeman chain code) and Statistical method (Zernike Moments). We have also proposed a new Combination of Structural and Statistical methods (Freeman chain code, Hu's invariant moment and center of mass) to extract feature vectors and it results into good amount of accuracy. These extracted feature vectors were further supplied as input into Support Vector Machines and their resulting accuracies were analyzed using 10 fold cross validation. SVM performs well on data sets that have many attributes and can also handle large number of classes.
机译:智能字符识别(ICR)是光学字符识别(OCR)的一种特殊形式,主要处理手写文本。由于它们的特殊性,它们通常更擅长于解释不同的笔迹样式和字体,从而最终提供更高的识别率。诸如语言结构,与该语言有关的ICR研究量等因素,从本质上决定了其字符识别所获得的成功量。这项研究主要涉及古吉拉特语手写字符的识别。我们考虑了34个辅音和5个元音;共有39个古吉拉特语角色。在分割的初始阶段,该语言的结构和词典构成了挑战。为此,我们提出了一种新的分割算法。我们的细分算法能够有效解决这些问题。已考虑将来自不同域的不同算法进行比较分析,例如来自空间域的变换域(DWT,DCT和DFT);几何方法(梯度特征),结构方法(弗里曼链码)和统计方法(泽尼克矩)。我们还提出了一种新的结构和统计方法的组合(弗里曼链码,胡氏不变矩和质心)来提取特征向量,从而获得了很好的准确性。这些提取的特征向量被进一步提供作为支持向量机的输入,并使用10倍交叉验证对它们产生的准确性进行了分析。 SVM在具有许多属性的数据集上表现良好,并且还可以处理大量的类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号