首页> 外文期刊>Expert Systems with Application >A robust system for document layout analysis using multilevel homogeneity structure
【24h】

A robust system for document layout analysis using multilevel homogeneity structure

机译:使用多层次同质性结构进行文档布局分析的强大系统

获取原文
获取原文并翻译 | 示例

摘要

One of the difficulties in the understanding of document images is document layout analysis, which is the first step in document image modeling. In this paper, a robust system for which a multilevel-homogeneity structure is used in accordance with a hybrid methodology is proposed to deal with this problem. Our system consists of the following three main stages: classification, segmentation, and refinement and labeling. Different from other page segmentation methods, the proposed system includes an efficient algorithm to detect table regions in document images. Besides, to create an effective application, the proposed system is designed to work with a variety of document languages. The proposed method was tested with the ICDAR2015 competition (RDCL-2015) and three other published datasets in different languages. The results of these tests show that the accuracy of proposed system is superior to the previous methods. (C) 2017 Elsevier Ltd. All rights reserved.
机译:理解文档图像的困难之一是文档布局分析,这是文档图像建模的第一步。在本文中,提出了一种鲁棒的系统,该系统根据混合方法使用了多级同质性结构来解决此问题。我们的系统包括以下三个主要阶段:分类,细分以及优化和标记。与其他页面分割方法不同,该系统包括一种有效的算法来检测文档图像中的表格区域。此外,为了创建有效的应用程序,提出的系统旨在与多种文档语言一起使用。在ICDAR2015竞赛(RDCL-2015)和其他三种使用不同语言的已发布数据集中测试了该方法。这些测试结果表明,所提出的系统的精度优于以前的方法。 (C)2017 Elsevier Ltd.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号