【24h】

Synthetic Dataset Generation for Text Recognition with Generative Adversarial Networks

机译:生成对抗网络的文本识别综合数据集生成

获取原文

摘要

Automated text recognition is used in autonomous driving systems, search engines, document analysis, and manyother applications. There are many techniques to extract text information from scanned documents, but text recognitionfrom arbitrary images is a much harder task. Recently suggested deep learning approaches have demonstrated highqualityresults, but they require a huge amount of data to achieve them. The process of collecting and labelling trainingdata to train a deep learning network is costly. In this paper, we suggest an approach for automatic dataset generation fortext recognition for arbitrary languages. We use a generative adversarial network structure, which is adapted to generatereadable and clear text looking naturally on the image background. We evaluate our approach using SegLink andTextboxes++ text localization models, which were trained on examples generated by SynthText and by variations of ourmethod. The comparison showed the superiority of our method on a subset of the ICDAR 2017 dataset for English andArabic languages.
机译:自动文本识别可用于自动驾驶系统,搜索引擎,文档分析以及许多其他领域。 其他应用程序。有很多技术可以从扫描的文档中提取文本信息,但是文本识别 从任意图像中提取图像是一项艰巨的任务。最近建议的深度学习方法已经证明了高质量 结果,但它们需要大量数据才能实现。收集和标记培训的过程 训练深度学习网络的数据非常昂贵。在本文中,我们提出了一种用于自动生成数据集的方法 任意语言的文本识别。我们使用生成式对抗网络结构,该结构适于生成 清晰可读的文本,自然而然地出现在图像背景上。我们使用SegLink和 Textboxes ++文本本地化模型,该模型针对SynthText生成的示例以及我们的变体进行了培训 方法。比较结果显示了我们的方法在ICDAR 2017数据集的英语和英语的子集上的优势 阿拉伯语。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号