首页> 外文会议>MMEDIA 2012 >A Database of Artificial Urdu Text in Video Images with Semi-Automatic Text Line Labeling Scheme

【24h】

A Database of Artificial Urdu Text in Video Images with Semi-Automatic Text Line Labeling Scheme

机译：半自动文本标记方案视频图像中的人工Urdu文本数据库

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper describes a novel database of video images containing artificial (superimposed) Urdu text with a semi-automatic text line labeling scheme. The main objective of this study is to provide the community with a standard dataset together with an auto-labeling scheme for algorithmic development and evaluation of textual content based indexing and retrieval systems. We have specifically focused on Urdu text which is increasingly gaining research interest in recent years. The data set comprises 1000 video images collected from 19 different channels of 5 different categories. An attempt is made to capture the maximum possible variation in the text in terms of size, location, appearance and background. The data set is completely labeled by finding the bounding rectangle of each text occurrence facilitating the evaluation of text detection and localization systems. Based on our previous work on text localization, an automatic text labeling scheme is also proposed and the obtained results are compared with manual labeling. Ground truth data, supporting tasks like text recognition and word spotting will be considered in the next version of the data set.

机译：本文介绍了包含具有半自动文本标记方案的人工（叠加）Urdu文本的视频图像的新数据库。本研究的主要目的是将社区与标准数据集一起提供与基于文本内容的索引和检索系统的算法开发和评估的自动标记方案。我们专门专注于乌尔都语文本，近年来越来越多地获得研究兴趣。数据集包括从19个不同类别的19个不同频道收集的1000个视频图像。尝试在大小，位置，外观和背景中捕获文本中的最大可能变化。通过查找促进文本检测和本地化系统的评估，通过查找每个文本发生的界定矩形来完全标记数据集。基于我们之前的文本定位的工作，还提出了一种自动文本标签方案，并将获得的结果与手动标签进行比较。地面真理数据，支持文本识别和单词发现等任务将在数据集的下一个版本中考虑。

著录项

来源
《MMEDIA 2012》|2012年||共7页
会议地点
作者
Imran Siddiqi; Ahsen Raza;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 73.9542083;
关键词
Data Set; Artificial Urdu Text; Text Detection; Text Localization;

机译：数据集;人造乌尔都语文本;文本检测;文本本地化;

相似文献

外文文献
中文文献
专利

1. MPEG-7 videotext description scheme for superimposed text in images and video [J] . Nevenka Dimitrova, Lalitha Agnihotri, Chitra Dorai, Signal Processing. Image Communication: A Publication of the the European Association for Signal Processing . 2000,第1a2期

机译：图像和视频中叠加文本的MPEG-7视频文本描述方案
2. Integrating and using large databases of text, images, video, and audio [J] . Hauptmann A.G. IEEE intelligent systems & their applications . 1999,第5期

机译：集成和使用大型文本，图像，视频和音频数据库
3. Detection of artificial and scene text in images and video frames [J] . Marios Anthimopoulos, Basilis Gatos, Ioannis Pratikakis Pattern Analysis and Applications . 2013,第3期

机译：检测图像和视频帧中的人工和场景文本
4. A Database of Artificial Urdu Text in Video Images with Semi-Automatic Text Line Labeling Scheme [C] . Imran Siddiqi, Ahsen Raza MMEDIA 2012 . 2012

机译：半自动文本标记方案视频图像中的人工Urdu文本数据库
5. Learning from text and images: Generative and discriminative models for partially labeled data. [D] . Yakhnenko, Oksana. 2009

机译：从文本和图像中学习：部分标记数据的生成模型和判别模型。
6. Cursive-Text: A Comprehensive Dataset for End-to-End Urdu Text Recognition in Natural Scene Images [O] . Asghar Ali Chandio, Md. Asikuzzaman, Mark Pickering, 2020

机译：草书文本：用于自然场景图像中端到端乌尔都语文本识别的综合数据集
7. Artificial Urdu Text Detection and Localization from Individual Video Frames [O] . Salahuddin Unar, Akhtar Hussain Jalbani, Muhammad Moazzam Jawaid, 2018

机译：人造乌尔都语文本检测和各个视频帧的定位
8. Automated System for Text Detection Individual Video Images [R] . Du, Y. , Chang, C. , Thouin, P. D. 2003

机译：用于文本检测的自动化系统单个视频图像

A Database of Artificial Urdu Text in Video Images with Semi-Automatic Text Line Labeling Scheme

摘要

著录项

相似文献

相关主题

期刊订阅