首页> 外文会议>International Conference on Graphic and Image Processing >Text Extraction from Images in the Wild Using the Viola-Jones Algorithm
【24h】

Text Extraction from Images in the Wild Using the Viola-Jones Algorithm

机译:使用中提琴jones算法从野外图像中提取图片

获取原文

摘要

Text Localization and extraction is an important issue in modern applications of computer vision. Applications such as reading and translating texts in the wild or from videos are among the many applications that can benefit results of this field. In this work, we adopt the well-known Viola-Jones algorithm to enable text extraction and localization from images in the wild. The Viola-Jones is an efficient, and a fast image-processing algorithm originally used for face detection. Based on some resemblance between text and face detection tasks in the wild, we have modified the viola-jones to detect regions of interest where text may be localized. In the proposed approach, some modification to the HAAR like features and a semi-automatic process of data set generating and manipulation were presented to train the algorithm. A process of sliding windows with different sizes have been used to scan the image for individual letters and letter clusters existence. A post processing step is used in order to combine the detected letters into words and to remove false positives. The novelty of the presented approach is using the strengths of a modified Viola-Jones algorithm to identify many different objects representing different letters and clusters of similar letters and later combine them into words of varying lengths. Impressive results were obtained on the ICDAR contest data sets.
机译:文本本地化和提取是计算机愿景现代应用中的一个重要问题。诸如狂野或来自视频中的读取和翻译文本的应用程序是许多可以利用该领域的结果的应用程序。在这项工作中,我们采用了知名的中提琴jones算法,使文本提取和从野外图像中的定位。 Viola-Jones是一个有效的,并且最初用于面部检测的快速图像处理算法。基于野外文本和面部检测任务之间的一些相似性,我们修改了Viola-jones来检测文本可能本地化的感兴趣区域。在提出的方法中,提出了对哈尔的一些修改,如特征和数据集生成和操纵的半自动过程以训练算法。使用不同尺寸滑动窗口的过程已被用于扫描图像以获取单个字母和字母集群存在。使用后处理步骤以将检测到的字母组合成单词并删除误报。所提出的方法的新颖性是使用修改的中提琴算法的优势来识别代表不同字母和类似字母的不同字母和群集的许多不同对象,然后将它们与变化长度的单词组合成不同长度的单词。在icdar竞赛数据集上获得了令人印象深刻的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号