首页> 外文会议>International Conference on Frontiers in Handwriting Recognition >Visual Perception of Unitary Elements for Layout Analysis of Unconstrained Documents in Heterogeneous Databases
【24h】

Visual Perception of Unitary Elements for Layout Analysis of Unconstrained Documents in Heterogeneous Databases

机译:异构数据库中不受约束文档布局分析的统一元素的视觉感知

获取原文

摘要

The document layout analysis is a complex task in the context of heterogeneous documents. It is still a challenging problem. In this paper, we present our contribution for the layout analysis competition of the international Maurdor Campaign. Our method is based on a grammatical description of the content of elements. It consists in iteratively finding and then removing the most structuring elements of documents. This method is based on notions of perceptive vision: a combination of points of view of the document, and the analysis of salient contents. Our description is generic enough to deal with a very wide range of heterogeneous documents. This method obtained the second place in Run 2 of Maurdor Campaign (on 1000 documents), and the best results in terms of pixel labeling for text blocs and graphic regions.
机译:在异构文档的上下文中,文档布局分析是一项复杂的任务。这仍然是一个具有挑战性的问题。在本文中,我们将为国际Maurdor运动的布局分析比赛做出贡献。我们的方法基于元素内容的语法描述。它包括迭代查找并删除文档中最结构化的元素。该方法基于感知视觉的概念:文档观点的结合以及对主要内容的分析。我们的描述足够通用,可以处理非常广泛的异构文档。该方法在Maurdor Campaign的运行2中排名第二(在1000个文档中),并且在文本块和图形区域的像素标记方面获得了最佳结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号