首页> 外文会议>Iberian Conference on Pattern Recognition and Image Analysis >The Diagonal Split: A Pre-segmentation Step for Page Layout Analysis and Classification
【24h】

The Diagonal Split: A Pre-segmentation Step for Page Layout Analysis and Classification

机译:对角线拆分:页面布局分析和分类的预分割步骤

获取原文

摘要

Document classification is an important task in all the processes related to document storage and retrieval. In the case of complex documents, structural features are needed to achieve a correct classification. Unfortunately, physical layout analysis is error prone. In this paper we present a pre-segmentation step based on a divide & conquer strategy that can be used to improve the page segmentation results, independently of the segmentation algorithm used. This pre-segmentation step is evaluated in classification and retrieval using the selective CRLA algorithm for layout segmentation together with a clustering based on the voronoi area diagram, and tested on two different databases, MARG and Girona Archives.
机译:文档分类是与文档存储和检索相关的所有进程中的重要任务。在复杂的文件的情况下,需要结构特征来实现正确的分类。不幸的是,物理布局分析易于出错。在本文中,我们基于分割和征服策略的预分割步骤,其可用于改善页面分段结果,独立于所用的分割算法。使用基于voronoi区域图的群集,在分类和检索中评估该预分割步骤,以及用于基于voronoi区域图的聚类,并在两个不同的数据库,Marg和Girona档案上进行测试。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号