DIGITNET: A Deep Handwritten Digit Detection and Recognition Method Using a New Historical Handwritten Digit Dataset

Kusetogullari Huseyin; Yavariabdi Amir; Hall Johan; Lavesson Niklas

首页> 外文期刊>Big Data Research >DIGITNET: A Deep Handwritten Digit Detection and Recognition Method Using a New Historical Handwritten Digit Dataset

【24h】

DIGITNET: A Deep Handwritten Digit Detection and Recognition Method Using a New Historical Handwritten Digit Dataset

机译：Digitnet：使用新的历史手写数字数据集进行深层手写的数字检测和识别方法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper introduces a novel deep learning architecture, named DIGITNET, and a large-scale digit dataset, named DIDA, to detect and recognize handwritten digits in historical document images written in the nineteen century. To generate the DIDA dataset, digit images are collected from 100, 000 Swedish handwritten historical document images, which were written by different priests with different handwriting styles. This dataset contains three sub-datasets including single digit, large-scale bounding box annotated multi-digit, and digit string with 250, 000, 25, 000, and 200, 000 samples in RedGreen-Blue (RGB) color spaces, respectively. Moreover, DIDA is used to train the DIGITNET network, which consists of two deep learning architectures, called DIGITNET-dect and DIGITNET-rec, respectively, to isolate digits and recognize digit strings in historical handwritten documents. In DIGITNET-dect architecture, to extract features from digits, three residual units where each residual unit has three convolution neural network structures are used and then a detection strategy based on You Look Only Once (YOLO) algorithm is employed to detect handwritten digits at two different scales. In DIGITNET-rec, the detected isolated digits are passed through 3 different designed Convolutional Neural Network (CNN) architectures and then the classification results of three different CNNs are combined using a voting scheme to recognize digit strings. The proposed model is also trained with various existing handwritten digit datasets and then validated over historical handwritten digit strings. The experimental results show that the proposed architecture trained with DIDA (publicly available from: https://didadataset.io/DIDA) outperforms the state-of-the-art methods. (C) 2020 The Author(s). Published by Elsevier Inc.

机译：本文介绍了一个名为Digitnet的新型深度学习架构，名为Dida的大型数字数据集，以检测和识别在十九世纪写的历史文档图像中的手写数字。要生成DIDA数据集，从1000 000瑞典手写历史文档图像中收集数字图像，这些历史文档图像由不同的牧师用不同的笔迹样式编写。该数据集包含三个子数据集，包括单位数字，大型边界框注释的多位数，以及分别在RedGreen-Blue（RGB）颜色空间中的250,000,25,000和200,000个样本的数字字符串。此外，Dida用于培训DigitNet网络，该网络分别由两个深入的学习架构组成，分别称为DigitNet-Dect和DigitNet-Rec，分离数字并识别历史手写文档中的数字字符串。在Digitnet-Dect架构中，要从数字中提取特征，三个残余单元使用，其中每个残差单元使用三个卷积神经网络结构，然后基于您的检测策略仅用于一次（YOLO）算法用于检测两个手写的数字不同的尺度。在DigitNet-REC中，检测到的隔离位通过3个不同设计的卷积神经网络（CNN）架构，然后使用投票方案组合三种不同CNN的分类结果以识别数字字符串。所提出的模型也接受了各种现有手写的数字数据集，然后通过历史手写数字字符串验证。实验结果表明，拟议的架构与DIDA培训（公开可供选择：https://didadataset.io/dida）优于最先进的方法。（c）2020提交人。 elsevier公司发布

著录项

来源
《Big Data Research》 |2021年第1期|共13页
作者
Kusetogullari Huseyin; Yavariabdi Amir; Hall Johan; Lavesson Niklas;
展开▼
作者单位

Blekinge Inst Technol Dept Comp Sci S-37141 Karlskrona Sweden;

KTO Karatay Univ Dept Mechatron Engn Konya Turkey;

Arkiv Digital Vaxjo Sweden;

Jonkoping Univ Sch Engn Dept Comp Sci SE-55318 Jonkoping Sweden;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
Historical handwritten documents; Handwritten digit detection; Ensemble deep learning; Digit string recognition; DIDA handwritten digit dataset;

机译：历史手写文件;手写的数字检测;集成深度学习;数字字符串识别;DIDA手写的数字数据集;
入库时间 2022-08-20 16:57:39

相似文献

外文文献
中文文献
专利

1. Handwritten digit recognition based on ghost imaging with deep learning [J] . Xing He, Sheng-Mei Zhao, Le Wang 中国物理：英文版 . 2021,第005期
2. ARDIS: a Swedish historical handwritten digit dataset [J] . Kusetogullari Huseyin, Yavariabdi Amir, Cheddad Abbas, Neural computing & applications . 2020,第21期

机译：Ardis：瑞典历史手写数字数据集
3. HANDWRITTEN DEVNAGARI DIGIT RECOGNITION: BENCHMARKING ON NEW DATASET [J] . RAJIV KUMAR, KIRAN KUMAR RAVULAKOLLU Journal of Theoretical and Applied Information Technology . 2014,第3期

机译：手动DEVNAGARI数字识别：在新数据集上进行基准测试
4. HANDWRITTEN DEVNAGARI DIGIT RECOGNITION: BENCHMARKING ON NEW DATASET [J] . RAJIV KUMAR, KIRAN KUMAR RAVULAKOLLU Journal of Theoretical and Applied Information Technology . 2014,第3期

机译：手动DEVNAGARI数字识别：在新数据集上进行基准测试
5. Curation of Historical Arabic Handwritten Digit Datasets from Ottoman Population Registers: A Deep Transfer Learning Case Study [C] . Yekta Said Can, M. Erdem Kabadayı IEEE International Conference on Big Data . 2020

机译：奥斯曼人群寄存器的历史阿拉伯手写数字数据集的策划：深度转移学习案例研究
6. Comparison of Search Algorithms in Two-Stage Neural Network Training for Optical Character Recognition of Handwritten Digits [D] . Gilley, Patrik Wayne. 2020

机译：两级神经网络训练中搜索算法的比较，用于手写数字的光学字符识别
7. Deep Convolutional Extreme Learning Machine and Its Application in Handwritten Digit Classification [O] . Shan Pang, Xinyi Yang 2016

机译：深度卷积极限学习机及其在手写数字分类中的应用
8. A Selective Attention-Based Method for Visual Pattern Recognition with Application to Handwritten Digit Recognition and Face Recognition [O] . Albert Ali Salah, Ethem Alpaydin, Lale Akarun 2002

机译：一种基于选择性注意的视觉模式识别方法及其在手写体数字识别和人脸识别中的应用
9. Subspace Classifiers in Recognition of Handwritten Digits. Acta PolytechnicaScandinavica Mathematics, Computing and Management in Engineering Series No. 84 [R] . Laaksonen, J. 1997

机译：手写数字识别中的子空间分类器。 acta polytechnicascandinavica数学，工程计算和管理系列第84号

DIGITNET: A Deep Handwritten Digit Detection and Recognition Method Using a New Historical Handwritten Digit Dataset

摘要

著录项

相似文献

相关主题

期刊订阅