首页> 外文期刊>Journal of ambient intelligence and humanized computing >Language identification from multi-lingual scene text images: a CNN based classifier ensemble approach
【24h】

Language identification from multi-lingual scene text images: a CNN based classifier ensemble approach

机译:来自多语言场景文本图像的语言识别:基于CNN的分类器集合方法

获取原文
获取原文并翻译 | 示例
           

摘要

Since the past two decades, detecting text regions in complex natural images has emerged as a problem of great interest for the research fraternity. This is because these regions of interest serve as source of information that can be utilized for various purposes. However, these regions may contain texts in multiple languages. Hence, identifying the corresponding language of a detected scene text becomes important for further information processing. Language identification of the text, captured in a wild, is an extremely challenging research field in the domain of scene text recognition. In this paper, a deep learning-based classifier combination approach is proposed to solve the problem of language identification from multi-lingual scene text images. In this work, a minimalist Convolutional Neural Network architecture is used as the base model. Five variants of an input image-three different channels of RGB color model (i.e. R for red, G for green and B for blue) along with RGB itself, and grayscale image are passed through the base model separately. The outcomes of these five models are combined using the classifier combination approaches based on sum rule and product rule. Performances of the proposed model have been evaluated on some standard datasets like KAIST and MLe2e as well as in-house multi-lingual scent text dataset. From the experimental results, it has been observed that the proposed model outperforms some state-of-the-art methods considered here for comparison.
机译:自过去的二十年以来,检测复杂自然图像中的文本区域被出现为对研究兄弟会的极大兴趣的问题。这是因为这些兴趣区域是可以用于各种目的的信息来源。但是,这些区域可能包含多种语言中的文本。因此,识别检测到的场景文本的相应语言对于进一步的信息处理变得重要。在野外捕获的文本的语言识别,是场景文本识别领域的一个极具挑战性的研究领域。在本文中,提出了一种深度学习的分类器组合方法,以解决多语言场景文本图像的语言识别问题。在这项工作中,将极简主义卷积神经网络架构用作基础模型。输入图像的五个变体 - 三种不同通道的RGB颜色模型(即红色,G的红色,G为蓝色的R为B为蓝色)以及RGB本身和灰度图像分别通过基础模型。使用基于SUM规则和产品规则的分类器组合方法相结合这五种模型的结果。已在kaist和mle2e等一些标准数据集以及内部多语言香料文本数据集中进行了评估了所提出的模型的性能。从实验结果中,已经观察到所提出的模型优于这里考虑的一些最先进的方法进行比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号