首页> 外文期刊>American Journal of Engineering Research >Digital Image Processing Analysis using Matlab
【24h】

Digital Image Processing Analysis using Matlab

机译:使用Matlab进行数字图像处理分析

获取原文
           

摘要

The intelligent analysis of video data is currently in wide demand because a video is a major source of sensory data in our lives. Text is a prominen t and direct source of information in video, while the recent surveys of text detection and recognition in imagery focus mainly on text extraction from scene images. Here, this paper presents a comprehen sive survey of text detection, tracking, and recognition in video with three major contributions. First, a generic framework is proposed for video text extraction that uniformly describes detection, tracking, recognition, and their relations and interactions. Second, within this framework, a variety of methods, systems, and evaluation protocols of video text extraction are summarized, compared, and analyzed. Existing text tracking techniques, tracking-based detection and recognition techniques are specifically highlighted. Third, related applications, prominent challenges, and future directions for video text extraction (especially from scene videos and web videos) are also thoroughly discussed. To this aim, a supervised DNN is trained to project the input samples into a discriminative feature space, in which the blur type can be easily classified. Then, for each blur type, the proposed GRNN estimates the blur parameters with very high accuracy. Experiments demonstrate the effectiveness of the proposed method in several tasks with better or competitive results compared with the state of the art on two standard image data sets.
机译:由于视频是我们生活中感官数据的主要来源,因此目前对视频数据的智能分析提出了广泛的要求。文本是视频中主要的信息来源,而最近对图像中文本检测和识别的调查主要集中在从场景图像中提取文本。在此,本文对视频中的文本检测,跟踪和识别进行了全面的调查,并做出了三个主要贡献。首先,提出了用于视频文本提取的通用框架,该框架统一描述了检测,跟踪,识别及其关系和交互作用。其次,在此框架内,总结,比较和分析了多种视频文本提取方法,系统和评估协议。特别强调了现有的文本跟踪技术,基于跟踪的检测和识别技术。第三,还详细讨论了视频文本提取(尤其是从场景视频和网络视频)的相关应用,突出的挑战和未来的发展方向。为此,对受监督的DNN进行了训练,以将输入样本投影到可区分特征空间中,在该特征空间中可以轻松分类模糊类型。然后,对于每种模糊类型,提出的GRNN都可以非常高精度地估计模糊参数。实验证明,与两个标准图像数据集上的最新技术相比,该方法在多项任务中具有更好或更具竞争性的效果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号