Generation of Synthetic Images of Full-Text Documents

机译：全文文档合成图像的生成

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we present an algorithm for generating images of full-text documents. Such images can be used to train and evaluate models of optical character recognition. The algorithm is modular, individual parts can be changed and tweaked to generate desired images. We describe a method for obtaining background images of paper from already digitalized documents. We use a Variational Autoencoder to train a generative model of these backgrounds enabling the generation of similar background images as the training ones on the fly. The module for printing the text uses large text corpora, font, and suitable positional and brightness noise to obtain believable results. We use Tesseract OCR to compare the real world and generated images and observe that the recognition rate is very similar indicating the proper appearance of the synthetic images. Furthermore, the mistakes made by the OCR system in both cases are alike. Finally, the system generates detailed, structured annotation of the synthesized image.

机译：在本文中，我们提出了一种用于生成全文本文档图像的算法。此类图像可用于训练和评估光学字符识别模型。该算法是模块化的，可以更改和调整各个部分以生成所需的图像。我们描述了一种从已经数字化的文档中获取纸张背景图像的方法。我们使用变分自动编码器来训练这些背景的生成模型，从而能够实时生成与训练背景相似的背景图像。用于打印文本的模块使用较大的文本语料库，字体以及适当的位置噪声和亮度噪声来获得可信的结果。我们使用Tesseract OCR比较现实世界和生成的图像，并观察到识别率非常相似，表明合成图像的正确外观。此外，OCR系统在两种情况下所犯的错误都是相似的。最后，系统生成合成图像的详细的结构化注释。

著录项

来源
《International Conference on speech and computer》|2018年|68-75|共8页
会议地点
作者
Lukas Bures; Petr Neduchal; Miroslav Hlavac; Marek Hruz;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Generating images; Character recognition Computer vision; Machine learning;

机译：生成图像;字符识别计算机视觉;机器学习;

相似文献

外文文献
中文文献
专利

1. DocCreator: A New Software for Creating Synthetic Ground-Truthed Document Images [J] . Antoine Billy, Nicholas Journet, Boris Mansencal, Journal of Imaging . 2017,第4期

机译：DocCreator：用于创建合成的真实地面文档图像的新软件
2. Intensity-based dual model method for generation of synthetic CT images from standard T2-weighted MR images – Generalized technique for four different MR scanners [J] . Lauri Koivula, Mika Kapanen, Tiina Sepp?l?, Radiotherapy and oncology: Journal of the European Society for Therapeutic Radiology and Oncology . 2017,第3期

机译：基于强度的双模型方法，用于生成标准T2加权MR图像的合成CT图像 - 四种不同MR扫描仪的通用技术
3. Generation of Synthetic but Visually Realistic Time Series of Cardiac Images Combining a Biophysical Model and Clinical Images [J] . Prakosa A., Sermesant M., Delingette H., Medical Imaging, IEEE Transactions on . 2013,第1期

机译：结合生物物理模型和临床图像的心脏图像合成但视觉逼真的时间序列的生成
4. Generation of Synthetic Images of Full-Text Documents [C] . Lukas Bures, Petr Neduchal, Miroslav Hlavac, International Conference on Speech and Computer . 2018

机译：全文文件的综合图像的产生
5. Visual Information Retrieval from Historical Document Images =La recherche d’information visuelle à partir d’images de documents historiques [D] . Zhalehpour, Sara. 2018

机译：从历史文档检索的视觉信息检索=搜索历史文档的视觉信息
6. WebMedline: Transforming Medline into a Hypertext Environment with Links to Full-Text Documents [O] . William M. Detmer, Edward H. Shortliffe 1996

机译：WebMedline：通过链接到全文文档将Medline转换为超文本环境
7. Semi-synthetic Document Image Generation Using Texture Mapping on Scanned 3D Document Shapes [O] . Kieu, Van Cuong, Journet, Nicholas, Visani, Muriel, 2013

机译：在扫描的3D文档形状上使用纹理映射生成半合成文档图像

Generation of Synthetic Images of Full-Text Documents

摘要

著录项

相似文献

相关主题

期刊订阅