OCFormer: A Transformer-Based Model For Arabic Handwritten Text Recognition

机译：ocformer：阿拉伯语手写文本识别的基于变压器的模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The Optical Character Recognition (OCR) of Arabic historical documents is a challenging task. The reason being the complexity of the layout and the highly variant typography. Nonetheless, in recent years, with the rise of Deep learning, significant progress has been made in historical OCR; in both layout recognition and segmentation, and also in character recognition. The only downside is the limited advancements dedicated to the Arabic language, notably the handwritten text. In this paper, we present an OCR approach that utilizes state-of-theart Deep learning techniques for the Arabic language. We built a custom dataset of obfuscated and noisy images to imitate the noise in historical Arabic documents, with a collection of 30 million images paired with their ground truth. The model utilizes both page segmentation and line segmentation techniques to enhance the resultant transcription. The model is complex enough for transcribing handwritten manuscripts. In addition, the model can detect and transcribe documents that contain Arabic diacritics. The model attained a CER of 0.0727, a WER of 0.0829, and a SER of 0.10.

机译：阿拉伯语历史文献的光学字符识别（OCR）是一个具有挑战性的任务。原因是布局的复杂性和高度变体排版。尽管如此，在近年来，随着深度学习的兴起，历史OCR中取得了重大进展;在布局识别和分段中，以及在字符识别中。唯一的缺点是致力于阿拉伯语的有限进步，特别是手写文本。在本文中，我们介绍了一种用于阿拉伯语的最终学习技术的OCR方法。我们构建了一个混淆和嘈杂的图像的自定义数据集，以模仿历史阿拉伯文文件中的噪音，其中包含3000万个图像与他们的实际真相配对。该模型利用页面分段和线分割技术来增强所得转录。该模型足以转换手写手稿的复杂性。此外，该模型可以检测和转录含有阿拉伯语变量的文档。该模型达到0.0727的CER，WER为0.0829，SER为0.10。

著录项

来源
《International Mobile, Intelligent, and Ubiquitous Computing Conference》|2021年|182-186|共5页
会议地点
作者
Aly Mostafa; Omar Mohamed; Ali Ashraf; Ahmed Elbehery; Salma Jamal; Ghada Khoriba; Amr S. Ghoneim;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Deep learning; Image segmentation; Text recognition; Computational modeling; Layout; Ubiquitous computing; Optical imaging;

机译：深入学习;图像分割;文本识别;计算建模;布局;无处不在的计算;光学成像;

相似文献

外文文献
中文文献
专利

1. Recognition of Cursive Arabic Handwritten Text Using Embedded Training Based on Hidden Markov Models [J] . Rabi Mouhcine, Amrouch Mustapha, Mahani Zouhir International Journal of Pattern Recognition and Artificial Intelligence . 2018,第1期

机译：基于隐马尔可夫模型的嵌入式训练对草书阿拉伯手写文本的识别
2. Offline handwritten Arabic cursive text recognition using Hidden Markov Models and re-ranking [J] . Jawad H AlKhateeb, Jinchang Ren, Jianmin Jiang, Pattern recognition letters . 2011,第8期

机译：使用隐马尔可夫模型进行离线手写阿拉伯草书文本识别并重新排序
3. A Holistic Model for Recognition of Handwritten Arabic Text Based on the Local Binary Pattern Technique [J] . Atallah AL-Shatnawi, Faisal Al-Saqqar, Safa’a Alhusban International Journal of Interactive Mobile Technologies . 2020,第16期

机译：基于局部二元模式技术的手写阿拉伯文文本的整体模型
4. Contribution on Character Modelling for Handwritten Arabic Text Recognition [C] . Anis Mezghani, Faten Kallel, Slim Kanoun, International Afro-European Conference for Industrial Advancement . 2017

机译：作者：王莹，王莹
5. Arabic handwritten text recognition using structural and syntactic pattern attributes. [D] . Parvez, Mohammad Tanvir. 2010

机译：使用结构和句法模式属性的阿拉伯语手写文本识别。
6. ASM Based Synthesis of Handwritten Arabic Text Pages [O] . Laslo Dinges, Ayoub Al-Hamadi, Moftah Elzobi, 2015

机译：基于ASM的阿拉伯语手写文本页面综合
7. Modeling and training options for handwritten Arabic text recognition [O] . Ahmad Irfan 2016

机译：手写阿拉伯文字识别的建模和培训选项

OCFormer: A Transformer-Based Model For Arabic Handwritten Text Recognition

摘要

著录项

相似文献

相关主题

期刊订阅