首页> 外文期刊>Computer Science & Information Technology >A Novel Feature Set for Recognition of Similar Shaped Handwritten Hindi Characters Using Machine Learning
【24h】

A Novel Feature Set for Recognition of Similar Shaped Handwritten Hindi Characters Using Machine Learning

机译:一种新型特征集,用于使用机器学习识别相似形状的手写印地语字符

获取原文
           

摘要

The growing need of handwritten Hindi character recognition in Indian offices such as passport, railway etc, has made it a vital area of research. Similar shaped characters are more prone to misclassification. In this paper four Machine Learning (ML) algorithms namely Bayesian Network, Radial Basis Function Network (RBFN), Multilayer Perceptron (MLP), and C4.5 Decision Tree are used for recognition of Similar Shaped Handwritten Hindi Characters (SSHHC) and their performance is compared. A novel feature set of 85 features is generated on the basis of character geometry. Due to the high dimensionality of feature vector, the classifiers can be computationally complex. So, its dimensionality is reduced to 11 and 4 using Correlation-Based (CFS) and Consistency-Based (CON) feature selection techniques respectively. Experimental results show that Bayesian Network is a better choice when used with CFS while C4.5 gives better performance with CON features.
机译:印度办公室(例如护照,铁路等)对手写印地语字符识别的需求不断增长,这使其成为一个重要的研究领域。类似形状的字符更容易分类错误。本文使用贝叶斯网络,径向基函数网络(RBFN),多层感知器(MLP)和C4.5决策树这四种机器学习(ML)算法来识别相似形状的手写印地语字符(SSHHC)及其性能比较。基于角色几何图形生成了一个包含85个特征的新颖特征集。由于特征向量的维数高,分类器的计算可能会很复杂。因此,分别使用基于相关性(CFS)和基于一致性(CON)的特征选择技术将其维数减少到11和4。实验结果表明,当与CFS一起使用时,贝叶斯网络是一个更好的选择,而C4.5的CON功能可以提供更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号