首页> 美国卫生研究院文献>Sensors (Basel Switzerland) >R-YOLO: A Real-Time Text Detector for Natural Scenes with Arbitrary Rotation
【2h】

R-YOLO: A Real-Time Text Detector for Natural Scenes with Arbitrary Rotation

机译:R-YOLO:用于自然场景的实时文本探测器随意旋转

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Accurate and efficient text detection in natural scenes is a fundamental yet challenging task in computer vision, especially when dealing with arbitrarily-oriented texts. Most contemporary text detection methods are designed to identify horizontal or approximately horizontal text, which cannot satisfy practical detection requirements for various real-world images such as image streams or videos. To address this lacuna, we propose a novel method called Rotational You Only Look Once (R-YOLO), a robust real-time convolutional neural network (CNN) model to detect arbitrarily-oriented texts in natural image scenes. First, a rotated anchor box with angle information is used as the text bounding box over various orientations. Second, features of various scales are extracted from the input image to determine the probability, confidence, and inclined bounding boxes of the text. Finally, Rotational Distance Intersection over Union Non-Maximum Suppression is used to eliminate redundancy and acquire detection results with the highest accuracy. Experiments on benchmark comparison are conducted upon four popular datasets, i.e., ICDAR2015, ICDAR2013, MSRA-TD500, and ICDAR2017-MLT. The results indicate that the proposed R-YOLO method significantly outperforms state-of-the-art methods in terms of detection efficiency while maintaining high accuracy; for example, the proposed R-YOLO method achieves an F-measure of 82.3% at 62.5 fps with 720 p resolution on the ICDAR2015 dataset.
机译:在自然场景中准确和有效的文本检测是计算机愿景中的基本且挑战性的任务,特别是在处理任意导向的文本时。大多数当代文本检测方法旨在识别水平或大约水平的文本,这不能满足各种真实世界图像的实际检测要求,例如图像流或视频。为了解决这个LECUA,我们提出了一种称为旋转的新方法,您只需看一次(R-YOLO),一个强大的实时卷积神经网络(CNN)模型,用于检测自然图像场景中的任意导向文本。首先,具有角度信息的旋转锚盒用作各种方向上的文本边界框。其次,从输入图像中提取各种尺度的特征,以确定文本的概率,置信度和倾斜边界框。最后,使用联盟非最大抑制的旋转距离交叉来消除冗余,并以最高精度获取检测结果。基准比较的实验在四个流行的数据集,即ICDAR2015,ICDAR2013,MSRA-TD500和ICDAR2017-MLT上进行。结果表明,所提出的R-Yolo方法在检测效率方面显着优于最先进的方法,同时保持高精度;例如,所提出的R-Yolo方法在ICDAR2015数据集上实现了62.5 FPS的62.5 FPS的F-Measet 82.3%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号