首页> 外国专利> GROUND TRUTH GENERATION FROM SCANNED DOCUMENTS

GROUND TRUTH GENERATION FROM SCANNED DOCUMENTS

机译:随附文档生成的地面真相

摘要

A plurality of electronic documents comprising one or more document pages are received. First position markers, second position markers and page identifiers are inserted to the pages. The plurality of electronic documents are printed, thereby generating a printed corpus comprising a plurality of printed documents. The plurality of printed documents are scanned, thereby generating a scanned corpus comprising a plurality of scanned images. Scanning frame positions of the first and the second position markers are detected and the detected scanning frame positions and the page positions are used to define affine transformations between the plurality of scanned images and the corresponding document pages. The affine transformations are applied to the plurality of scanned images to align the plurality of scanned images with the corresponding document pages of the plurality of electronic documents.
机译:接收包括一个或多个文档页面的多个电子文档。第一位置标记,第二位置标记和页面标识符被插入到页面中。打印多个电子文档,从而生成包括多个打印文档的打印文集。扫描多个打印文档,从而生成包括多个扫描图像的扫描语料库。检测第一位置标记和第二位置标记的扫描框位置,并且使用检测到的扫描框位置和页面位置来定义多个扫描图像和相应文档页面之间的仿射变换。仿射变换被应用于多个扫描图像以将多个扫描图像与多个电子文档的对应文档页面对准。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号