...
首页> 外文期刊>Journal of Real-Time Image Processing >Fast RT-LoG operator for scene text detection
【24h】

Fast RT-LoG operator for scene text detection

机译:用于场景文本检测的快速RT-log运算符

获取原文
获取原文并翻译 | 示例

摘要

This paper proposes a new real-time Laplacian of Gaussian (RT-LoG) operator for scene text detection. This method takes advantage of the Gaussian kernel distribution in the spatial/scale-space domains and kernel decomposition with the box filtering method. Two levels of optimization are given. The first level of optimization within the spatial domain is obtained by box mutualization. The second level of optimization within the spatial/scale-space domains is performed using a mixed method for box selection. The proposed RT-LoG operator is evaluated on the ICDAR2017 RRC-MLT dataset in terms of robustness and time processing. The results are compared with the state-of-the-art real-time operators for scene text detection. The proposed operator appears as the top performance with the best trade-off between robustness and time processing. The proposed operator can support approximately 30 frames per second (FPS) up to the Quad-HD resolution on a regular CPU architecture with a low-level latency. In addition, the proposed operator can support the full pipeline for scene text detection. Our system is competitive with the top accurate systems of the literature while processing with a difference of two orders of magnitude in term of processing resources.
机译:本文为现场文本检测提出了高斯(RT-Log)运算符的新实时拉普拉斯。该方法利用了空间/刻度空间域的高斯内核分布以及盒式滤波方法的内核分解。给出了两种级别的优化。空间域内的第一级优化是通过盒子相互化的获得。使用混合方法进行盒子选择来执行空间/尺度空间域内的第二级优化水平。在鲁棒性和时间处理方面,在ICDAR2017 RRC-MLT数据集上评估了所提出的RT-Log运算符。结果与现场文本检测的最先进的实时操作员进行比较。建议的操作员作为最佳性能,具有鲁棒性和时间处理之间的最佳权衡。所提出的运算符可以通过低级别延迟,通过常规CPU架构上的Quad-HD分辨率提供大约30帧(FPS)(FPS)。此外,所提出的操作员可以支持用于场景文本检测的完整管道。我们的系统与文献的顶级精确系统具有竞争力,同时在处理资源期间具有两个数量级的差异。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号