首页> 外文会议>IAPR International Conference on Document Analysis and Recognition >Bag of Local Convolutional Triplets for Script Identification in Scene Text
【24h】

Bag of Local Convolutional Triplets for Script Identification in Scene Text

机译:用于场景文本中脚本识别的局部卷积三胞胎袋

获取原文

摘要

The increasing interest in scene text reading in multilingual environments raises the need to recognize and distinguish between different writing systems. In this paper, we propose a novel method for script identification in scene text using triplets of local convolutional features in combination with the traditional bag-of-visual-words model. Feature triplets are created by making combinations of descriptors extracted from local patches of the input images using a convolutional neural network. This approach allows us to generate a more descriptive codeword dictionary for the bag-of-visual-words model, as the low discriminative power of weak descriptors is enhanced by other descriptors in a triplet. The proposed method is evaluated on two public benchmark datasets for scene text script identification and a public dataset for script identification in video captions. The experiments demonstrate that our method outperforms the baseline and yields competitive results on all three datasets.
机译:在多语言环境中,对场景文本阅读的兴趣日益浓厚,因此需要识别和区分不同的书写系统。在本文中,我们提出了一种使用局部卷积特征的三元组结合传统的视觉词袋模型在场景文本中进行脚本识别的新方法。通过使用卷积神经网络对从输入图像的局部补丁中提取的描述符进行组合来创建特征三元组。这种方法使我们能够为视觉词袋模型生成更具描述性的代码字字典,因为弱描述符的低判别能力会通过三元组中的其他描述符得到增强。在两个公共基准数据集(用于场景文本脚本识别)和一个公共数据集(用于在视频字幕中标识脚本)上评估了所提出的方法。实验表明,我们的方法优于基线,并且在所有三个数据集上均具有竞争性结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号