首页> 外文会议>IAPR International Workshop on Document Analysis Systems >Skew Estimation of Sparsely Inscribed Document Fragments
【24h】

Skew Estimation of Sparsely Inscribed Document Fragments

机译:稀疏铭刻文档碎片的倾斜估计

获取原文

摘要

Document analysis is done to analyze entire forms (e.g. intelligent form analysis, table detection) or to describe the layout/structure of a document for further processing. A pre-processing step of document analysis methods is a skew estimation of scanned or photographed documents. Current skew estimation methods require the existence of large text areas, are dependent on the text type and can be limited on a specific angle range. The proposed method is gradient based in combination with a Focused Nearest Neighbor Clustering of interest points and has no limitations regarding the detectable angle range. The upside/down decision is based on statistical analysis of ascenders and descenders. It can be applied to entire documents as well as to document fragments containing only a few words. Results show that the proposed skew estimation is comparable with state-of-the-art methods and outperforms them on a real dataset consisting of 658 snippets.
机译:完成文档分析以分析整个表格(例如智能形式分析,表检测)或描述文档的布局/结构以进行进一步处理。 文档分析方法的预处理步骤是扫描或拍摄文档的偏差估计。 当前的偏斜估计方法需要存在大的文本区域,依赖于文本类型,并且可以限制在特定角度范围内。 所提出的方法是基于渐变基于兴趣点的聚焦最近邻聚类的梯度,并且对可检测角度范围没有限制。 上行/下降决策是基于上升人员和后裔的统计分析。 它可以应用于整个文档以及仅包含几个单词的文件片段。 结果表明,所提出的偏斜估计与最先进的方法相当,并且在由658个片段组成的实际数据集中优于它们。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号