...
首页> 外文期刊>Pattern Analysis and Applications >Detection of artificial and scene text in images and video frames
【24h】

Detection of artificial and scene text in images and video frames

机译:检测图像和视频帧中的人工和场景文本

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Textual information in images and video frames constitutes a valuable source of high-level semantics for multimedia indexing and retrieval systems. Text detection is the most crucial step in a multimedia text extraction system and although it has been extensively studied the past decade still, it does not exist a generic architecture that would work for artificial and scene text in multimedia content. In this paper we propose a system for text detection of both artificial and scene text in images and video frames. The system is based on a machine learning stage which uses an Random Forest classifier and a highly discriminative feature set produced by using a new texture operator called Multilevel Adaptive Color edge Local Binary Pattern (MACeLBP). MACeLBP describes the spatial distribution of color edges in multiple adaptive levels of contrast. Then, a gradient-based algorithm is applied to achieve distinction among text lines as well as refinement in the localization of the text lines. The whole algorithm is situated in a multiresolution framework to achieve invariance to scale for the detection of text lines. Finally, an optional connected-component step segments text lines into words based on the distances between the resulting components. The experimental results are produced by applying a concise evaluation methodology and prove the superior performance achieved by the proposed text detection system for artificial and scene text in images and video frames.
机译:图像和视频帧中的文本信息构成了多媒体索引和检索系统高级语义的宝贵来源。文本检测是多媒体文本提取系统中最关键的一步,尽管在过去的十年中已经对其进行了广泛的研究,但它不存在适用于多媒体内容中的人工文本和场景文本的通用体系结构。在本文中,我们提出了一种用于图像和视频帧中的人工文本和场景文本的文本检测系统。该系统基于机器学习阶段,该阶段使用随机森林分类器和通过使用称为多级自适应彩色边缘局部二进制图案(MACeLBP)的新纹理运算符产生的高度区分性特征集。 MACeLBP描述了多个自适应对比度级别中颜色边缘的空间分布。然后,应用基于梯度的算法来实现文本行之间的区分以及文本行定位中的细化。整个算法位于多分辨率框架中,以实现不变性以进行缩放以检测文本行。最后,一个可选的“连接组件”步骤会根据结果组件之间的距离将文本行分割为单词。实验结果是通过使用简洁的评估方法得出的,并证明了所提出的文本检测系统对图像和视频帧中的人工文本和场景文本具有卓越的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号