首页> 外文会议>International Conference on Document Analysis and Recognition >Patch Aggregator for Scene Text Script Identification
【24h】

Patch Aggregator for Scene Text Script Identification

机译:修补程序聚合器,用于场景文本脚本识别

获取原文

摘要

Script identification in the wild is of great importance in a multi-lingual robust-reading system. The scripts deriving from the same language family share a large set of characters, which makes script identification a fine-grained classification problem. Most existing methods make efforts to learn a single representation that combines the local features by making a weighted average or other clustering methods, which may reduce the discriminatory power of some important parts in each script for the interference of redundant features. In this paper, we present a novel module named Patch Aggregator (PA), which learns a more discriminative representation for script identification by taking into account the prediction scores of local patches. Specifically, we design a CNN-based method consisting of a standard CNN classifier and a PA module. Experiments demonstrate that the proposed PA module brings significant performance improvements over the baseline CNN model, achieving the state-of-the-art results on three benchmark datasets for script identification: SIW-13, CVSI 2015 and RRC-MLT 2017.
机译:在多语言的健壮阅读系统中,野外脚本识别非常重要。源自同一语言家族的脚本共享大量字符,这使得脚本标识成为细粒度的分类问题。大多数现有方法都通过学习加权平均或其他聚类方法来努力学习结合局部特征的单一表示形式,这可能会降低每个脚本中一些重要部分对冗余特征的干扰的区分能力。在本文中,我们提出了一个名为Patch Aggregator(PA)的新颖模块,该模块通过考虑局部补丁的预测得分来学习更具判别性的脚本识别表示。具体来说,我们设计了一个基于CNN的方法,该方法由标准CNN分类器和PA模块组成。实验表明,所提出的PA模块在基准CNN模型上带来了显着的性能提升,在三个用于脚本识别的基准数据集上实现了最先进的结果:SIW-13,CVSI 2015和RRC-MLT 2017。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号