Patch Aggregator for Scene Text Script Identification

机译：修补程序聚合器，用于场景文本脚本识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Script identification in the wild is of great importance in a multi-lingual robust-reading system. The scripts deriving from the same language family share a large set of characters, which makes script identification a fine-grained classification problem. Most existing methods make efforts to learn a single representation that combines the local features by making a weighted average or other clustering methods, which may reduce the discriminatory power of some important parts in each script for the interference of redundant features. In this paper, we present a novel module named Patch Aggregator (PA), which learns a more discriminative representation for script identification by taking into account the prediction scores of local patches. Specifically, we design a CNN-based method consisting of a standard CNN classifier and a PA module. Experiments demonstrate that the proposed PA module brings significant performance improvements over the baseline CNN model, achieving the state-of-the-art results on three benchmark datasets for script identification: SIW-13, CVSI 2015 and RRC-MLT 2017.

机译：在多语言的健壮阅读系统中，野外脚本识别非常重要。源自同一语言家族的脚本共享大量字符，这使得脚本标识成为细粒度的分类问题。大多数现有方法都通过学习加权平均或其他聚类方法来努力学习结合局部特征的单一表示形式，这可能会降低每个脚本中一些重要部分对冗余特征的干扰的区分能力。在本文中，我们提出了一个名为Patch Aggregator（PA）的新颖模块，该模块通过考虑局部补丁的预测得分来学习更具判别性的脚本识别表示。具体来说，我们设计了一个基于CNN的方法，该方法由标准CNN分类器和PA模块组成。实验表明，所提出的PA模块在基准CNN模型上带来了显着的性能提升，在三个用于脚本识别的基准数据集上实现了最先进的结果：SIW-13，CVSI 2015和RRC-MLT 2017。

著录项

来源
《International Conference on Document Analysis and Recognition》|2019年|1077-1083|共7页
会议地点
作者
Changxu Cheng; Qiuhui Huang; Xiang Bai; Bin Feng; Wenyu Liu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Feature extraction; Training; Aggregates; Convolution; Text recognition; Standards; Fuses;

机译：特征提取;训练;聚合;卷积;文本识别;标准;保险丝;
入库时间 2022-08-26 14:34:37

相似文献

外文文献
中文文献
专利

1. Residual attention-based multi-scale script identification in scene text images [J] . Ma Mengkai, Wang Qiu-Feng, Huang Shan, Neurocomputing . 2021,第Jana15期

机译：基于残余的关注的多尺度脚本识别在现场文本图像中
2. Multi-script text versus non-text classification of regions in scene images [J] . Sriman Bowornrat, Schomaker Lambert Journal of visual communication & image representation . 2019,第JULa期

机译：场景图像中区域的多脚本文本与非文本分类
3. Multi-script text versus non-text classification of regions in scene images [J] . Sriman Bowornrat, Schomaker Lambert Journal of visual communication & image representation . 2019,第Jula期

机译：多脚本文本与场景图像中区域的非文本分类
4. Patch Aggregator for Scene Text Script Identification [C] . Changxu Cheng, Qiuhui Huang, Xiang Bai, International Conference on Document Analysis and Recognition . 2019

机译：用于场景文本脚本标识的补丁聚合器
5. THE IDENTIFICATION OF LIFE SCRIPT ELEMENTS BY PERSONS POSSESSING VARYING LEVELS OF TRAINING AND EXPERIENCE IN TRANSACTIONAL ANALYSIS PRINCIPLES AND LIFE SCRIPT THEORY. [D] . PREPURA, WAYNE ANDREW. 1979

机译：在交易分析原理和寿命脚本理论中，通过掌握变化的训练水平和经验的人员来识别寿命脚本元素。
6. Cursive-Text: A Comprehensive Dataset for End-to-End Urdu Text Recognition in Natural Scene Images [O] . Asghar Ali Chandio, Md. Asikuzzaman, Mark Pickering, 2020

机译：草书文本：用于自然场景图像中端到端乌尔都语文本识别的综合数据集
7. Improving patch-based scene text script identification with ensembles of conjoined networks [O] . Gomez, Lluis, Nicolaou, Anguelos, Karatzas, Dimosthenis 2017

机译：使用集合改进基于补丁的场景文本脚本识别联合网络

Patch Aggregator for Scene Text Script Identification

摘要

著录项

相似文献

相关主题

期刊订阅