首页> 外文期刊>Neurocomputing >Learning to predict more accurate text instances for scene text detection
【24h】

Learning to predict more accurate text instances for scene text detection

机译:学习预测场景文本检测的更准确的文本实例

获取原文
获取原文并翻译 | 示例

摘要

At present, multi-oriented text detection methods based on deep neural network have achieved promising performances on various benchmarks. Nevertheless, there are still some difficulties for arbitrary shape text detection, especially for a simple and proper representation of arbitrary shape text instances. In this paper, a pixel-based text detector is proposed to facilitate the representation and prediction of text instances with arbitrary shapes in a simple manner. Firstly, to alleviate the influence of the target vertex sorting and achieve the direct regression of arbitrary shape text instances, the starting-point independent coordinates regression loss is proposed. Furthermore, to predict more accurate text instances, the text instance accuracy loss is proposed as an assistant task to refine the predicted coordinates under the guidance of IoU. To evaluate the effectiveness of our detector, extensive experiments have been carried on public benchmarks which contain arbitrary shape text instances and multi oriented text instances. We obtain 84.8% of F-measure on Total-Text benchmark. The results show that our method can reach state-of-the-art performance.(c) 2021 Elsevier B.V. All rights reserved.
机译:目前,基于深度神经网络的多种文本检测方法在各种基准上取得了有希望的表现。然而,任意形状文本检测仍然存在一些困难,特别是对于任意形状文本实例的简单且正确的表示。在本文中,提出了一种基于像素的文本检测器,以便以简单的方式促进具有任意形状的文本实例的表示和预测。首先,为了减轻目标顶点分类的影响并实现任意形状文本实例的直接回归,提出了起始点独立坐标回归损耗。此外,为了预测更准确的文本实例,提出了文本实例精度损耗作为在iou的指导下优化预测坐标的助理任务。为了评估我们的探测器的有效性,在公共基准上进行了广泛的实验,其中包含任意形状文本实例和多面向文本实例。我们在全文基准上获得84.8%的F测量。结果表明,我们的方法可以达到最先进的性能。(c)2021 Elsevier B.v.保留所有权利。

著录项

  • 来源
    《Neurocomputing》 |2021年第18期|455-463|共9页
  • 作者单位

    Chinese Acad Sci Inst Automat 95 Zhongguancun East Rd Beijing 100190 Peoples R China|Univ Chinese Acad Sci Sch Artificial Intelligence Beijing 100049 Peoples R China;

    Chinese Acad Sci Inst Automat 95 Zhongguancun East Rd Beijing 100190 Peoples R China;

    Chinese Acad Sci Inst Automat 95 Zhongguancun East Rd Beijing 100190 Peoples R China;

    Chinese Acad Sci Inst Automat 95 Zhongguancun East Rd Beijing 100190 Peoples R China;

    Chinese Acad Sci Inst Automat 95 Zhongguancun East Rd Beijing 100190 Peoples R China;

    Chinese Acad Sci Inst Automat 95 Zhongguancun East Rd Beijing 100190 Peoples R China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Scene text detection; Curved text; Direct regression;

    机译:场景文本检测;弯曲文本;直接回归;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号