Automatic Word Ground Truth Generation for Camera Captured Documents

Sheraz AHMED; Koichi KISE; Masakazu IWAMURA; Marcus LIWICKI; Andreas DENGEL

首页> 外文期刊>電子情報通信学会技術研究報告. パターン認識·メディア理解. Pattern Recognition and Media Understanding >Automatic Word Ground Truth Generation for Camera Captured Documents

【24h】

Automatic Word Ground Truth Generation for Camera Captured Documents

机译：相机捕获文档的自动单词地面真相生成

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

A database for camera captured documents is useful to train OCRs to obtain better performance. However, no dataset exists for camera captured documents because it is very laborious and costly to build these datasets manually. In this paper, a fully automatic approach allowing building the very large scale (i.e., millions of images) labeled camera captured documents dataset is proposed. The proposed approach does not require any human intervention in labeling. Evaluation of samples generated by the -proposed approach shows that more than 97% of the images are correctly labeled. Novelty of the proposed approach lies in the use of document image retrieval for automatic labeling, especially for camera captured documents, which contain different distortions specific to camera, e.g., blur, perspective distortion, etc.

机译：相机捕获的文档数据库对于训练OCR以获得更好的性能很有用。但是，不存在相机捕获的文档的数据集，因为手动构建这些数据集非常费力且昂贵。在本文中，提出了一种全自动方法，该方法可以构建非常大规模（即数百万张图像）的带标签的摄像机捕获文档数据集。提议的方法不需要任何人为干预。通过提议的方法生成的样本的评估显示，正确地标记了超过97％的图像。所提出方法的新颖之处在于将文档图像检索用于自动标记，尤其是对于相机捕获的文档，该文档图像包含针对相机的不同变形，例如模糊，透视变形等。

著录项

来源
《電子情報通信学会技術研究報告. パターン認識·メディア理解. Pattern Recognition and Media Understanding》 |2012年第495期|共6页
作者
Sheraz AHMED; Koichi KISE; Masakazu IWAMURA; Marcus LIWICKI; Andreas DENGEL;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类图像通信、多媒体通信;
关键词
Ground truth; Locally Likely Arrangement Hashing (LLAH); Camera Captured Documents; Perspective distortion; Blur;

机译：基本事实;局部可能的哈希排序（LLAH）;相机捕获的文档;透视失真;模糊;

相似文献

外文文献
中文文献
专利

1. Automatic Word Ground Truth Generation for Camera Captured Documents [J] . Sheraz AHMED, Koichi KISE, Masakazu IWAMURA, 電子情報通信学会技術研究報告 . 2013,第495期

机译：用于相机捕获的文档的自动Word Ground Truth生成
2. Automatic localization and extraction of tables from handheld mobile-camera captured handwritten document images [J] . Amarnath R., Sindhushree G. S., Nagabhushan P., Journal of intelligent & fuzzy systems: Applications in Engineering and Technology . 2019,第3期

机译：手持式移动摄像机捕获的手写文档图像自动本地化和提取表格
3. The use of Gabor features for semi-automatically generated polyon-based ground truth of historical document images [J] . Wei Hao, Seuret Mathias, Liwicki Marcus, Literary & linguistic computing . 2017,第aprasuppla1期

机译：使用Gabor功能半自动生成基于Polyon的历史文档图像地面真实情况
4. Automatic Ground Truth Generation of Camera Captured Documents Using Document Image Retrieval [C] . Ahmed Sheraz, Kise Koichi, Iwamura Masakazu, International Conference on Document Analysis and Recognition . 2013

机译：使用文档图像检索自动生成相机捕获的文档的地面真相
5. Registration and categorization of camera captured documents. [D] . Edupuganti, Venkata Gopal. 2012

机译：摄像机捕获文件的注册和分类。
6. Semi-automatic ground truth generation using unsupervised clustering and limited manual labeling: Application to handwritten character recognition [O] . Szilárd Vajda, Yves Rangoni, Hubert Cecotti -1

机译：使用无监督聚类和有限手动标记的半自动地面真相生成：在手写字符识别中的应用
7. Automatic Ground-truth Generation for Document Image Analysis and Understanding [O] . Héroux, Pierre, Barbu, Eugen, Adam, Sébastien, 2007

机译：自动生成地面真相，用于文档图像分析和理解

Automatic Word Ground Truth Generation for Camera Captured Documents

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅