首页> 外文会议>International Conference on Innovations in Information Technology >Empirical analysis of generalization through augmentation for classifying images of vernacular handwritten texts
【24h】

Empirical analysis of generalization through augmentation for classifying images of vernacular handwritten texts

机译:通过增强对白话手写文本图像的推广概括的实证分析

获取原文

摘要

Handwritten image recognition [1] is a challenging problem that has seen tremendous research efforts in recent years from the computer vision community. Although many deep learning-based classifications algorithms are studied in the literature for handwritten image recognition [2], most of the works are concentrated on a few widely used benchmark datasets [3]. Hence, it is difficult to understand how well a certain model generalizes across different optical characters with vast complexity levels. The existing research works for classification from handwritten image datasets are primarily focused on developing better model architectures, better training procedures, better regularization methods such as Image Augmentation, Dropout, Dropblock, etc. [4–6], hard example mining techniques [7], better Loss Functions such as Focal loss [8], Weighted Cross Entropy loss, etc. In this paper, we focus on image augmentation techniques along with an example mining technique to establish a robust classification baseline for handwritten datasets using a novel combination of augmentation methods and online hard example mining. Through our experiments, we show the combination of advanced image augmentation techniques best suited for achieving the highest macro averaged recall score and highlight a few important trends therein.
机译:手写的图像识别[1]是一个具有挑战性的问题,近年来从计算机视觉社区近年来看过巨大的研究工作。虽然在文献中研究了许多基于深度学习的分类算法,但是手写图像识别的文献[2],大多数作品集中在少数广泛使用的基准数据集中[3]。因此,很难理解某种模型在不同的光学字符上概括,具有巨大的复杂程度。现有的研究作品从手写图像数据集分类主要集中在开发更好的模型架构,更好的培训程序,更好的正则化方法,如图像增强,丢弃,丢弃等[4-6],硬示例挖掘技术[7] ,更好的损失功能,如焦丢失[8],加权交叉熵损失等。在本文中,我们专注于图像增强技术以及一个示例挖掘技术,用于使用增强的新颖组合来建立手写数据集的强大分类基线方法和在线硬示例挖掘。通过我们的实验,我们展示了最适合实现最高宏观平均召回得分的先进图像增强技术的组合,并突出其中一些重要趋势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号