Generation of Synthetic Images of Full-Text Documents

机译：全文文件的综合图像的产生

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we present an algorithm for generating images of full-text documents. Such images can be used to train and evaluate models of optical character recognition. The algorithm is modular, individual parts can be changed and tweaked to generate desired images. We describe a method for obtaining background images of paper from already digitalized documents. We use a Variational Autoencoder to train a generative model of these backgrounds enabling the generation of similar background images as the training ones on the fly. The module for printing the text uses large text corpora, font, and suitable positional and brightness noise to obtain believable results. We use Tesseract OCR to compare the real world and generated images and observe that the recognition rate is very similar indicating the proper appearance of the synthetic images. Furthermore, the mistakes made by the OCR system in both cases are alike. Finally, the system generates detailed, structured annotation of the synthesized image.

机译：在本文中，我们提出了一种用于生成全文文档图像的算法。这些图像可用于训练和评估光学字符识别的模型。算法是模块化的，可以改变各个部件并调整以产生所需的图像。我们描述了一种从已经数字化文档获得纸张背景图像的方法。我们使用变形式AutoEncoder来培训这些背景的生成模型，使得类似背景图像的产生作为训练。打印文本的模块使用大型文本语料库，字体和合适的位置和亮度噪声来获得可信结果。我们使用TESSERACT OCR来比较现实世界和生成的图像并观察到识别率非常相似，表明合成图像的适当外观。此外，OCR系统在这两种情况下的错误都是相似的。最后，系统生成合成图像的详细，结构化注释。

著录项

来源
《International Conference on Speech and Computer》|2018年|xv 791 p.|共8页
会议地点
作者
Lukas Bures; Petr Neduchal; Miroslav Hlavac; Marek Hruz;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词
Generating images; Character recognition Computer vision; Machine learning;

机译：生成图像;字符识别计算机视觉;机器学习;

相似文献

外文文献
中文文献
专利

1. DocCreator: A New Software for Creating Synthetic Ground-Truthed Document Images [J] . Antoine Billy, Nicholas Journet, Boris Mansencal, Journal of Imaging . 2017,第4期

机译：DocCreator：用于创建合成的真实地面文档图像的新软件
2. Intensity-based dual model method for generation of synthetic CT images from standard T2-weighted MR images – Generalized technique for four different MR scanners [J] . Lauri Koivula, Mika Kapanen, Tiina Sepp?l?, Radiotherapy and oncology: Journal of the European Society for Therapeutic Radiology and Oncology . 2017,第3期

机译：基于强度的双模型方法，用于生成标准T2加权MR图像的合成CT图像 - 四种不同MR扫描仪的通用技术
3. Generation of Synthetic but Visually Realistic Time Series of Cardiac Images Combining a Biophysical Model and Clinical Images [J] . Prakosa A., Sermesant M., Delingette H., Medical Imaging, IEEE Transactions on . 2013,第1期

机译：结合生物物理模型和临床图像的心脏图像合成但视觉逼真的时间序列的生成
4. Generation of Synthetic Images of Full-Text Documents [C] . Lukas Bures, Petr Neduchal, Miroslav Hlavac, International Conference on speech and computer . 2018

机译：全文文档合成图像的生成
5. Visual Information Retrieval from Historical Document Images =La recherche d’information visuelle à partir d’images de documents historiques [D] . Zhalehpour, Sara. 2018

机译：从历史文档检索的视觉信息检索=搜索历史文档的视觉信息
6. WebMedline: Transforming Medline into a Hypertext Environment with Links to Full-Text Documents [O] . William M. Detmer, Edward H. Shortliffe 1996

机译：WebMedline：通过链接到全文文档将Medline转换为超文本环境
7. Semi-synthetic Document Image Generation Using Texture Mapping on Scanned 3D Document Shapes [O] . Kieu, Van Cuong, Journet, Nicholas, Visani, Muriel, 2013

机译：在扫描的3D文档形状上使用纹理映射生成半合成文档图像

Generation of Synthetic Images of Full-Text Documents

摘要

著录项

相似文献

相关主题

期刊订阅