...
首页> 外文期刊>Computer Vision, IET >Method for unconstrained text detection in natural scene image
【24h】

Method for unconstrained text detection in natural scene image

机译:自然场景图像中无约束文本检测的方法

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Text detection in natural scene images is an important prerequisite for many content-based multimedia understanding applications. The authors present a simple and effective text detection method in natural scene image. Firstly, MSERs are extracted by the V-MSER algorithm from channels of , , , , and , as component candidates. Since text is composed of character candidates, the authors design an MRF model to exploit the relationship between characters. Secondly, in order to filter out non-text components, they design a set of two-layers filtering scheme: most of the non-text components can be filtered by the first layer of the filtering scheme; the second layer filtering scheme is an AdaBoost classifier, which is trained by the features of compactness, horizontal variance and vertical variance, and aspect ratio. Then, only four simple features are adopted to generate component pairs. Finally, according to the orientation similarity of the component pairs, component pairs which have roughly the same orientation are merged into text lines. The proposed method is evaluated on two public datasets: ICDAR 2011 and MSRA-TD500. It achieves 82.94 and 75% -measure, respectively. Especially, the experimental results, on their URMQ_LHASA-TD220 dataset which contains 220 images for multi-orientation and multi-language text lines evaluation, show that the proposed method is general for detecting scene text lines in different languages.
机译:自然场景图像中的文本检测是许多基于内容的多媒体理解应用程序的重要前提。作者提出了一种简单有效的自然场景图像文本检测方法。首先,通过V-MSER算法从,,,和的通道中提取MSER作为候选分量。由于文本由候选字符组成,因此作者设计了一个MRF模型来利用字符之间的关系。其次,为了过滤掉非文本成分,他们设计了一套两层过滤方案:大多数非文本成分可以被过滤方案的第一层过滤;第二层过滤方案是AdaBoost分类器,它通过紧凑性,水平方差和垂直方差以及长宽比等特征进行训练。然后,仅采用四个简单特征来生成组件对。最后,根据组件对的方向相似性,将具有大致相同方向的组件对合并为文本行。在两个公共数据集上评估了所提出的方法:ICDAR 2011和MSRA-TD500。它分别达到82.94和75%的度量。尤其是,在其URMQ_LHASA-TD220数据集上的实验结果包含220幅图像,用于多方位和多语言文本行评估,结果表明,该方法可用于检测不同语言的场景文本行。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号