首页> 外文会议>IEEE National Computing Colleges Conference >End-to-End Optical Character Recognition for Bengali Handwritten Words
【24h】

End-to-End Optical Character Recognition for Bengali Handwritten Words

机译:Bengali手写单词的端到端光学字符识别

获取原文

摘要

Optical character recognition (OCR) is a process of converting analogue documents into digital using document images. Currently, many commercial and non-commercial OCR systems exist for both handwritten and printed copies for different languages. Despite this, very few works are available in case of recognising Bengali words. Among them, most of the works focused on OCR of printed Bengali characters. This paper introduces an end-to-end OCR system for Bengali language. The proposed architecture implements an end to end strategy that recognises handwritten Bengali words from handwritten word images. We experiment with popular convolutional neural network (CNN) architectures, including DenseNet, Xception, NASNet, and MobileNet to build the OCR architecture. Further, we experiment with two different recurrent neural networks (RNN) methods, LSTM and GRU. We evaluate the proposed architecture using BanglaWritting dataset, which is a peer-reviewed Bengali handwritten image dataset. The proposed method achieves 0.091 character error rate and 0.273 word error rate performed using DenseNet121 model with GRU recurrent layer.
机译:光学字符识别(OCR)是使用文档图像将模拟文档转换为数字图像的过程。目前,不同语言的手写和印刷副本都存在许多商业和非商业OCR系统。尽管如此,在识别孟加拉语时,可以使用很少的作品。其中,大多数作品专注于印刷孟加拉人物的OCR。本文介绍了孟加拉语的端到端OCR系统。拟议的架构实现了结束到结束策略,识别手写的字图像中的手写孟加拉语。我们试验热门的卷积神经网络(CNN)架构,包括Densenet,Xcepion,NASnet和MobileNet来构建OCR架构。此外,我们试验两种不同的经常性神经网络(RNN)方法,LSTM和GRU。我们使用Banglawritt DataSet评估所提出的架构,该数据集是一个对等待的孟加拉手写图像数据集。所提出的方法实现了使用Densenet121模型进行了0.091个字符的误差率和0.273字误差率,使用GRU复发层进行了Gru复发层。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号