首页> 外文会议>International Conference on Document Analysis and Recognition >Automatic Ground Truth Generation of Camera Captured Documents Using Document Image Retrieval
【24h】

Automatic Ground Truth Generation of Camera Captured Documents Using Document Image Retrieval

机译:使用文档图像检索自动生成相机捕获的文档的地面真相

获取原文

摘要

In this paper a novel method for automatic ground truth generation of camera captured document images is proposed. Currently, no dataset is available for camera captured documents. It is very difficult to build these datasets manually, as it is very laborious and costly. The proposed method is fully automatic, allowing building the very large scale (i.e., millions of images) labeled camera captured documents dataset, without any human intervention. Evaluation of samples generated by the proposed approach shows that 99.98% of the images are correctly labeled. Novelty of the proposed approach lies in the use of document image retrieval for automatic labeling, especially for camera captured documents, which contain different distortions specific to camera, e.g., blur, occlusion, perspective distortion, etc.
机译:本文提出了一种新的方法来自动生成相机捕获的文档图像的地面真相。当前,没有数据集可用于相机捕获的文档。手动构建这些数据集非常困难,因为这既费力又费钱。所提出的方法是全自动的,允许在没有任何人工干预的情况下构建非常大规模的(即,数百万张图像)带标签的摄像机捕获的文档数据集。对通过该方法生成的样本进行的评估显示,正确标记了99.98%的图像。提出的方法的新颖性在于将文档图像检索用于自动标记,尤其是对于相机捕获的文档,其中包含针对相机的不同变形,例如模糊,遮挡,透视变形等。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号