首页> 外文期刊>Knowledge-Based Systems >Stacking ensemble model of deep learning and its application to Persian/Arabic handwritten digits recognition
【24h】

Stacking ensemble model of deep learning and its application to Persian/Arabic handwritten digits recognition

机译:深度学习堆放模型及其在波斯/阿拉伯语手写数字识别中的应用模型

获取原文
获取原文并翻译 | 示例
           

摘要

One of the challenges in recognizing handwritten texts is the individual style of writing. There is the structural similarity of the different digits to each other in writing. Along with the mentioned challenge, these similarities in the text may increase and make it difficult to correctly recognize digits and numbers. In this paper, a new model for recognizing handwritten digits is presented. The proposed model is a stacking ensemble classifier. This classifier is based on the convolutional neural network (CNN) and the bidirectional long-short term memory (BLSTM).Another innovation of the model is the use of the probability vector of images class as the input of the meta-classifier layer. One of the strengths of BLSTM is the ability to learn arrays and vectors; therefore, from a technical point of view, considering the output probability vector of the first model as the input of the meta-classifier (BLSTM) improves the accuracy of the deep learning model.The reason for using stacking ensemble classification is the sameness of the main body of some Persian/Arabic digits (e.g., "2, 3, and 4"). Also, the style the author writes makes classes that are not similar in a structure similar to each other, which causes errors incorrect recognition. This model helps to recognize the correct set of input digits by examining the structure of similarities.To achieve a reliable result in the face of this challenge, this model has been tested on a large Persian/Arabic dataset to cover a wide range of writing styles from different individuals. The dataset has a total of 102,352 data which 60,000 of them are for training data and 20,000 of them are for test data in ten classes of digits that are used in this paper.The result of using this database is to improve the recognition performance of these challenging digits. In examining the dataset presented by the model, the accuracy rate of the training set was 99.98%. And the sample accuracy rate in the test set was 99.39%. That is, compared to experimenting with the convolutional neural network and other researches, the rate has increased. (All codes available at http://web.nit.ac.ir/-h.omranpour/.) (C) 2021 Elsevier B.V. All rights reserved.
机译:识别手写文本的挑战之一是个人写作风格。写作中彼此的不同数字的结构相似性。随着提到的挑战,文本中的这些相似之处可能会增加并使其难以正确识别数字和数字。在本文中,提出了一种用于识别手写数字的新模型。所提出的模型是堆叠集合分类器。该分类器基于卷积神经网络(CNN)和双向长短短期存储器(BLSTM)。该模型的其他创新是使用图像类的概率向量作为元分类器层的输入。 BLSTM的一个优势是学习阵列和载体的能力;因此,从技术角度来看,考虑到第一模型的输出概率向量作为元分类器(BLSTM)的输入提高了深度学习模型的准确性。使用堆叠集合分类的原因是一些波斯/阿拉伯语数字的主体(例如,“2,3和4”)。此外,作者写入的样式使得在与彼此类似的结构中不相似的类,这导致错误错误识别。该模型通过检查相似性的结构,有助于识别正确的输入数字集。在面对这一挑战方面获得可靠的结果,该模型已在大型波斯/阿拉伯语数据集中进行测试,以涵盖各种写作风格来自不同的人。该数据集总共有102,352个数据,其中60,000个用于训练数据,其中20,000个是在本文中使用的十个数字中的测试数据。使用此数据库的结果是提高这些数据的识别性能具有挑战性的数字。在检查模型呈现的数据集时,培训集的准确率为99.98%。测试集中的样本精度率为99.39%。也就是说,与尝试卷积神经网络和其他研究相比,速率增加了。 (所有代码在http://web.nit.ac.ir/-h.omranpour/。)(c)2021 elestvier b.v.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号