首页> 外文会议>International Conference on Machine Vision >Generation method of synthetic training data for mobile OCR system
【24h】

Generation method of synthetic training data for mobile OCR system

机译:移动OCR系统的合成训练数据的生成方法

获取原文

摘要

This paper addresses one of the fundamental problems of machine learning - training data acquiring. Obtaining enough natural training data is rather difficult and expensive. In last years usage of synthetic images has become more beneficial as it allows to save human time and also to provide a huge number of images which otherwise would be difficult to obtain. However, for successful learning on artificial dataset one should try to reduce the gap between natural and synthetic data distributions. In this paper we describe an algorithm which allows to create artificial training datasets for OCR systems using russian passport as a case study.
机译:本文涉及机器学习培训数据获取的基本问题之一。获得足够的自然训练数据相当困难和昂贵。在过去几年中,合成图像的使用已经变得更有利,因为它允许节省人类时间,并且还提供大量图像,否则难以获得。但是,为了成功地学习人工数据集,应该尝试降低自然和合成数据分布之间的差距。在本文中,我们描述了一种算法,它允许使用俄罗斯护照作为案例研究创建OCR系统的人工训练数据集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号