【24h】

Metric Rectification of Curved Document Images

机译:弯曲文档图像的度量校正

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we propose a metric rectification method to restore an image from a single camera-captured document image. The core idea is to construct an isometric image mesh by exploiting the geometry of page surface and camera. Our method uses a general cylindrical surface (GCS) to model the curved page shape. Under a few proper assumptions, the printed horizontal text lines are shown to be line convergent symmetric. This property is then used to constrain the estimation of various model parameters under perspective projection. We also introduce a paraperspective projection to approximate the nonlinear perspective projection. A set of close-form formulas is thus derived for the estimate of GCS directrix and document aspect ratio. Our method provides a straightforward framework for image metric rectification. It is insensitive to camera positions, viewing angles, and the shapes of document pages. To evaluate the proposed method, we implemented comprehensive experiments on both synthetic and real-captured images. The results demonstrate the efficiency of our method. We also carried out a comparative experiment on the public CBDAR2007 data set. The experimental results show that our method outperforms the state-of-the-art methods in terms of OCR accuracy and rectification errors.
机译:在本文中,我们提出了一种度量校正方法,可以从单个相机捕​​获的文档图像中还原图像。核心思想是通过利用页面表面和相机的几何形状来构建等距图像网格。我们的方法使用一般的圆柱面(GCS)来模拟弯曲的页面形状。在一些适当的假设下,打印的水平文本行显示为行收敛对称。然后,此属性用于约束透视投影下各种模型参数的估计。我们还介绍了一个准透视投影来近似非线性透视投影。因此,得出了一组近似形式的公式,用于估算GCS Directrix和文档纵横比。我们的方法为图像度量校正提供了一个简单的框架。它对相机的位置,视角和文档页面的形状不敏感。为了评估所提出的方法,我们对合成图像和实际捕获图像都进行了综合实验。结果证明了我们方法的有效性。我们还对公共CBDAR2007数据集进行了比较实验。实验结果表明,在OCR精度和整流误差方面,我们的方法优于最新方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号