...
首页> 外文期刊>Journal of Theoretical and Applied Information Technology >HANDWRITTEN BENGALI CHARACTER RECOGNITION THROUGH GEOMETRY BASED FEATURE EXTRACTION
【24h】

HANDWRITTEN BENGALI CHARACTER RECOGNITION THROUGH GEOMETRY BASED FEATURE EXTRACTION

机译:基于几何特征提取的手写孟加拉语字符识别

获取原文
           

摘要

Unlike English characters, one of the major drawbacks in recognizing handwritten Bengali script is the massive amount of characters in Bengali language and their complex shapes. There are 50 complex shaped characters in Bengali alphabet set and working with this huge amount of characters with an appropriate set of feature is a tough problem to solve. Moreover, the ambiguity and precision error are common in handwritten words. In addition, among the huge amount of complex shaped characters, some are very similar in shape those possess severe difficulty to recognize handwritten Bengali characters. Bearing in mind the complexity of the problem, an efficient approach for recognizing handwritten Bengali alphabet is proposed in this work. This proposed approach for identifying Bengali characters is based on character geometry-oriented feature extraction for different handwritten characters. In this paper, different image processing steps are used including image acquisition, digitization , preprocessing, segmentation and feature extraction for tackling the difficulty. Most importantly, the geometry based feature extraction method has been employed to extract the effective features from the Bengali characters for the classification purposes. Then, the classification result was measured for SVM and Artificial Neural Network (ANN) based classifiers on self-generated training and testing data sets which contain 2500 different samples of 50 characters in the Bengali character-set. The proposed technique produces an average recognition rate of 84.56% using SVM and 74.47% using ANN.
机译:与英文字符不同,识别手写孟加拉语脚本的主要缺点之一是孟加拉语中的大量字符及其形状复杂。孟加拉语字母集中有50个复杂形状的字符,要处理大量具有适当功能的字符是一个难题。此外,歧义性和精度误差在手写单词中很常见。此外,在数量众多的复杂形状字符中,有些形状非常相似,难以识别孟加拉的手写字符。考虑到问题的复杂性,在这项工作中提出了一种识别手写孟加拉语字母的有效方法。所提出的孟加拉语字符识别方法基于针对不同手写字符的面向字符几何特征提取。为了解决这一难题,本文采用了不同的图像处理步骤,包括图像采集,数字化,预处理,分割和特征提取。最重要的是,基于几何的特征提取方法已被用于从孟加拉字符中提取有效特征,以进行分类。然后,对基于支持向量机和人工神经网络(ANN)的分类器的分类结果进行测量,这些分类器是基于自生成的训练和测试数据集,该数据集包含2500个孟加拉字符集中的50个字符的样本。所提出的技术使用SVM产生的平均识别率为84.56%,使用ANN产生的平均识别率为74.47%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号