首页> 外文会议>International Conference on Information Technology Research >Sinhala Handwritten Character Recognition using Convolutional Neural Network
【24h】

Sinhala Handwritten Character Recognition using Convolutional Neural Network

机译:使用卷积神经网络的Sinhala手写字符识别

获取原文

摘要

Handwritten character recognition is widely used for the English language. It is difficult to create a character recognition model for south Asian languages because of its shape and compound characters. Among other South Asian languages (e.g.: - Tamil, Hindi, Malayalam, etc.) Sinhala characters are unique, because of their shape, which are having mostly curves and dots. These unique characteristics make it difficult to create a model to recognize Sinhala’s handwritten characters. Recognizing handwritten characters rather than typed characters is more complicated because the handwriting of each individual is varying from each other. Therefore the recognition of Sinhala handwritten character need to be improved. Convolutional Neural Network (CNN) is playing a vital role in character recognition by supporting the more efficient image classification. This research focuses on recognizing Sinhala handwritten characters using CNN. Google colaboratory platform is used for the experiment, and python programming language is used for the implementation part. In total, around 110,000 image data were used for the experiment. CNN’s performance was evaluated by training and testing the dataset by increasing the number of character classes. When it reaches 100 character class it shows reasonable accuracy of 90.27%. The model was trained by 5 sets of different 100 character classes. Finally, the overall accuracy of 82.33% is achieved for 434 characters. This model outerformed than similar systems.
机译:手写字符识别广泛用于英语。由于其形状和复合字符,难以为南亚语言创建一个字符识别模型。在其他南亚语言(例如: - 泰米尔,印地语,Malayalam等)中,Sinhala角色是独一无二的,因为它们具有主要曲线和点。这些独特的特征使得难以创建模型来识别Sinhala的手写字符。识别手写字符而不是键入字符更复杂,因为每个人的手写彼此不同。因此,需要改善对Sinhala手写性格的识别。卷积神经网络(CNN)通过支持更有效的图像分类,在字符识别中扮演至关重要的作用。这项研究侧重于使用CNN识别Sinhala手写字符。谷歌Colaboratory平台用于实验,并且Python编程语言用于实施部分。总共有大约110,000个图像数据用于实验。通过增加字符类的数量,通过培训和测试数据集来评估CNN的性能。当它到达100个字符的类时,它显示了90.27%的合理精度。该模型培训了5套不同的100个字符类。最后,实现了434个字符的总精度为82.33%。这种型号外在的类似系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号