首页> 外文会议>IEEE Advanced Information Management, Communicates, Electronic and Automation Control Conference >A Multi-Level Feature Fusion Network For Scene Text Detection with Text Attention Mechanism
【24h】

A Multi-Level Feature Fusion Network For Scene Text Detection with Text Attention Mechanism

机译:用于场景文本检测的多级别融合网络,文本注意力机制

获取原文

摘要

To solve the problems of missed text detection and inaccurate text region location in natural scene text detection, a multi-scale and multi-level feature fusion network with attention mechanism is proposed. This method uses the Mask RCNN as the basic framework and improve the backbone ResNet with deformable convolution and rectangular pooling, which extracts multi-level features including global-level, word-level, and character-level, which can extract more comprehensive and richer text feature information. Besides, this paper proposes a region proposal network with text attention mechanism. Experimental results show that the algorithm model effectively extract more useful feature information, significantly improves the recall and accuracy of text detection compared with the current existing methods, and can be applied to actual text detection tasks.
机译:为了解决自然场景文本检测中错过的文本检测和不准确的文本区域位置的问题,提出了一种具有注意机制的多尺度和多级别的功能融合网络。 此方法使用蒙版RCNN作为基本框架,并改进具有可变形卷积和矩形池的骨干reset,其中提取包括全局级别,字级和字符级别的多级功能,可以提取更全面和更丰富的文本 功能信息。 此外,本文提出了一种具有文本关注机制的地区提案网络。 实验结果表明,算法模型有效提取更有用的特征信息,显着提高了与当前现有方法相比的文本检测的召回和准确性,并且可以应用于实际的文本检测任务。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号