首页> 外文会议>International Conference on Frontiers in Handwriting Recognition >A Study of Data Augmentation for Handwritten Character Recognition using Deep Learning
【24h】

A Study of Data Augmentation for Handwritten Character Recognition using Deep Learning

机译:基于深度学习的手写字符识别数据扩充研究

获取原文

摘要

While convolutional neural networks have made significant achievements in the field of handwriting recognition in recent years, large amounts of training data are required to obtain satisfactory results. To prepare large amounts of image data for training without increased labor, there is a way of increasing the number of images by applying general image processing methods, so-called data augmentation. However, it is difficult to generate character images like those written by different people and to overcome the problems related to the lack of training data by using conventional data augmentation methods. In this paper, we propose a method of acquiring the probability distribution of the features related to the character structure and generating character images of various handwritings using the probability distribution. The proposed method obtains statistical character structure models composed of probability distributions of strokes by learning from character image data. By generating strokes based on the probability distribution of each stroke and assembling them into a character, it becomes possible to generate character images of various handwriting samples not influenced by the original images. In the comparative experiments of handwritten character recognition with a convolutional neural network, good results could be obtained using not only conventional data augmentation methods but also the proposed method together.
机译:尽管近年来卷积神经网络在手写识别领域取得了重大成就,但需要大量的训练数据才能获得令人满意的结果。为了在不增加劳动的情况下准备用于训练的大量图像数据,存在一种通过应用一般的图像处理方法(所谓的数据增强)来增加图像数量的方法。然而,难以通过使用常规的数据增强方法来生成像由不同人所写的字符图像那样的字符图像,并且难以克服与缺少训练数据有关的问题。在本文中,我们提出了一种获取与字符结构有关的特征的概率分布并使用该概率分布生成各种笔迹的字符图像的方法。通过从字符图像数据中学习,该方法获得了由笔划的概率分布组成的统计字符结构模型。通过基于每个笔划的概率分布生成笔划并将它们组装为字符,可以生成不受原始图像影响的各种笔迹样本的字符图像。在使用卷积神经网络进行手写字符识别的比较实验中,不仅可以使用常规数据增强方法,而且可以将所提出的方法一起使用,可以获得良好的效果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号