TK-Text: Multi-shaped Scene Text Detection via Instance Segmentation

机译：TK-Text：通过实例分割的多形场景文本检测

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Benefit from the development of deep neural networks, scene text detectors have progressed rapidly over the past few years and achieved outstanding performance on several standard benchmarks. However, most existing methods adopt quadrilateral bounding boxes to represent texts, which are usually inadequate to deal with multi-shaped texts such as the curved ones. To keep consist detection performance on both quadrilateral and curved texts, we present a novel representation, i.e., text kernel, for multi-shaped texts. On the basis of text kernel, we propose a simple yet effective scene text detection method, named as TK-Text. The proposed method consists of three steps, namely text-context-aware network, segmentation map generation and text kernel based post-clustering. During text-context-aware network, we construct a segmentation-based network to extract feature map from natural scene images, which are further enhanced with text context information extracted from an attention scheme TKAB. In segmentation map generation, text kernels and rough boundaries of text instances are segmented based on the enhanced feature map. Finally, rough text instances are gradually refined to generate accurate text instances by performing clustering based on text kernel. Experiments on public benchmarks including SCUT-CTW1500, ICDAR 2015 and ICDAR 2017 MLT demonstrate that the proposed method achieves competitive detection performance comparing with the existing methods.

机译：得益于深度神经网络的发展，场景文本检测器在过去几年中发展迅速，并在多个标准基准上均表现出色。然而，大多数现有方法采用四边形边界框来表示文本，这通常不足以处理诸如弯曲文本之类的多种形状的文本。为了在四边形和弯曲文本上保持一致性检测性能，我们提出了一种新颖的表示形式，即用于多形文本的文本核。基于文本内核，我们提出了一种简单而有效的场景文本检测方法，称为TK-Text。所提出的方法包括三个步骤，即文本上下文感知网络，分割图生成和基于文本内核的后聚类。在感知文本上下文的网络中，我们构建了一个基于分段的网络以从自然场景图像中提取特征图，并通过从关注方案TKAB中提取的文本上下文信息进一步增强了该特征图。在分割图生成中，基于增强的特征图对文本核和文本实例的粗略边界进行分割。最后，通过基于文本内核执行聚类，逐步完善粗略的文本实例以生成准确的文本实例。在包括SCUT-CTW1500，ICDAR 2015和ICDAR 2017 MLT在内的公开基准测试中，该方法与现有方法相比具有竞争优势。

著录项

来源
《International Conference on Multimedia Modeling》|2020年|201-213|共13页
会议地点
作者
Xiaoge Song; Yirui Wu; Wenhai Wang; Tong Lu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Multi-shaped scene text detection; Instance segmentation; Text-context-aware network; Text kernel;

机译：多形场景文本检测;实例分割;文本上下文感知网络;文字内核;
入库时间 2022-08-26 13:55:04

相似文献

外文文献
中文文献
专利

1. Instance Segmentation Network With Self-Distillation for Scene Text Detection [J] . Yang Peng, Yang Guowei, Gong Xun, Quality Control, Transactions . 2020,第期

机译：实例分割网络，具有自蒸馏场景文本检测
2. Learning to predict more accurate text instances for scene text detection [J] . Li Xiaoqian, Liu Jie, Zhang Guixuan, Neurocomputing . 2021,第Auga18期

机译：学习预测场景文本检测的更准确的文本实例
3. Natural Scene Text Detection and Segmentation Using Phase-Based Regions and Character Retrieval [J] . Julia Diaz-Escobar, Vitaly Kober Mathematical Problems in Engineering: Theory, Methods and Applications . 2020,第1期

机译：使用基于阶段地区和字符检索的自然场景文本检测和分割
4. TK-Text: Multi-shaped Scene Text Detection via Instance Segmentation [C] . Xiaoge Song, Yirui Wu, Wenhai Wang, International Conference on Multimedia Modeling . 2020

机译：TK-Text：多重场景文本检测通过实例分段
5. 3D Object Detection, Instance Segmentation and Classification from 3D Range and 2D Color Images [D] . Shen, Xiaoke. 2021

机译：3D对象检测，实例分段和3D范围和2D彩色图像的分类
6. CLoDSA: a tool for augmentation in classification localization detection semantic segmentation and instance segmentation tasks [O] . Ángela Casado-García, César Domínguez, Manuel García-Domínguez, 2019

机译：CLoDSA：扩展分类本地化检测语义分割和实例分割任务的工具
7. Survey on Text Detection, Segmentation and Recognition from a Natural Scene Images [O] . Uma B. Karanje, Rahul Dagade 2015

机译：自然场景图像文本检测，分割与识别研究综述

TK-Text: Multi-shaped Scene Text Detection via Instance Segmentation

摘要

著录项

相似文献

相关主题

期刊订阅