首页> 外文会议>Iberian conference on pattern recognition and image analysis >Impact of Pre-Processing on Recognition of Cursive Video Text
【24h】

Impact of Pre-Processing on Recognition of Cursive Video Text

机译:预处理对草书视频文本识别的影响

获取原文

摘要

Recognition of text appearing in videos offers a number of interesting applications including retrieval systems, generation of user alerts on keywords and news summarization systems. Thanks to the recent advancements in deep learning, high text recognition rates have been reported in the recent years. An important step in training such systems is the pre-processing of images for effective feature learning and classification. This study investigates the impact of pre-processing on recognition of cursive video text using Urdu as a case study. The recognition engine relies on a combination of convolutional and long short-term memory networks followed by a connectionist temporal classification layer for sequence alignment. The system is fed with gray scale text line images directly as well as by segmenting the text from background using various thresholding techniques. Experimental study on a dataset of 12,000 text lines in cursive Urdu text reveals that appropriately preprocessing the text line images significantly improves the recognition rates.
机译:识别视频中出现的文本提供了许多有趣的应用程序,包括检索系统,关键字用户警报生成和新闻摘要系统。由于深度学习方面的最新进展,近年来已报道了很高的文本识别率。训练此类系统的重要一步是对图像进行预处理,以进行有效的特征学习和分类。本研究以乌尔都语为案例,研究了预处理对草书视频文本识别的影响。识别引擎依赖于卷积和长短期记忆网络的组合,其后是连接主义者的时间分类层,用于序列比对。该系统可以直接获得灰度文本行图像,也可以使用各种阈值技术将文本从背景中分割出来。对草书乌尔都语文本中的12,000个文本行的数据集进行的实验研究表明,对文本行图像进行适当的预处理可以显着提高识别率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号