Impact of Pre-Processing on Recognition of Cursive Video Text

机译：预处理对草书视频文本识别的影响

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recognition of text appearing in videos offers a number of interesting applications including retrieval systems, generation of user alerts on keywords and news summarization systems. Thanks to the recent advancements in deep learning, high text recognition rates have been reported in the recent years. An important step in training such systems is the pre-processing of images for effective feature learning and classification. This study investigates the impact of pre-processing on recognition of cursive video text using Urdu as a case study. The recognition engine relies on a combination of convolutional and long short-term memory networks followed by a connectionist temporal classification layer for sequence alignment. The system is fed with gray scale text line images directly as well as by segmenting the text from background using various thresholding techniques. Experimental study on a dataset of 12,000 text lines in cursive Urdu text reveals that appropriately preprocessing the text line images significantly improves the recognition rates.

机译：识别视频中出现的文本提供了许多有趣的应用程序，包括检索系统，关键字用户警报生成和新闻摘要系统。由于深度学习方面的最新进展，近年来已报道了很高的文本识别率。训练此类系统的重要一步是对图像进行预处理，以进行有效的特征学习和分类。本研究以乌尔都语为案例，研究了预处理对草书视频文本识别的影响。识别引擎依赖于卷积和长短期记忆网络的组合，其后是连接主义者的时间分类层，用于序列比对。该系统可以直接获得灰度文本行图像，也可以使用各种阈值技术将文本从背景中分割出来。对草书乌尔都语文本中的12,000个文本行的数据集进行的实验研究表明，对文本行图像进行适当的预处理可以显着提高识别率。

著录项

来源
《Iberian conference on pattern recognition and image analysis》|2019年|565-576|共12页
会议地点
作者
Ali Mirza; Imran Siddiqi; Syed Ghulam Mustufa; Mazahir Hussain;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Cursive video text; Binarization; Convolutional neural networks (CNNs); Long short-term memory networks (LSTMs);

机译：草书视频文字;二值化;卷积神经网络（CNN）;长短期记忆网络（LSTM）;

相似文献

外文文献
中文文献
专利

1. Detection and recognition of cursive text from video frames [J] . Ali Mirza, Ossama Zeshan, Muhammad Atif, EURASIP journal on image and video processing . 2020,第1期

机译：从视频帧中检测和识别法学文本
2. Cursive-Text: A Comprehensive Dataset for End-to-End Urdu Text Recognition in Natural Scene Images [J] . Asghar Ali Chandio, Md. Asikuzzaman, Mark Pickering, Data in Brief . 2020,第3期

机译：Cursive-Text：自然场景图像中的端到端核心文本识别的全面数据集
3. Recognition of Cursive Arabic Handwritten Text Using Embedded Training Based on Hidden Markov Models [J] . Rabi Mouhcine, Amrouch Mustapha, Mahani Zouhir International Journal of Pattern Recognition and Artificial Intelligence . 2018,第1期

机译：基于隐马尔可夫模型的嵌入式训练对草书阿拉伯手写文本的识别
4. Impact of Pre-Processing on Recognition of Cursive Video Text [C] . Ali Mirza, Imran Siddiqi, Syed Ghulam Mustufa, Iberian conference on pattern recognition and image analysis . 2019

机译：预处理对卷积视频文本识别的影响
5. Novel algorithms for video text extraction with application to license plate recognition. [D] . Chen, Minya. 2004

机译：用于视频文本提取的新算法，并应用于车牌识别。
6. Cursive-Text: A Comprehensive Dataset for End-to-End Urdu Text Recognition in Natural Scene Images [O] . Asghar Ali Chandio, Md. Asikuzzaman, Mark Pickering, 2020

机译：草书文本：用于自然场景图像中端到端乌尔都语文本识别的综合数据集
7. Detection and recognition of cursive text from video frames [O] . Ali Mirza, Ossama Zeshan, Muhammad Atif, 2020

机译：从视频帧中检测和识别法学文本

Impact of Pre-Processing on Recognition of Cursive Video Text

摘要

著录项

相似文献

相关主题

期刊订阅