...
首页> 外文期刊>Neurocomputing >TabCellNet: Deep learning-based tabular cell structure detection
【24h】

TabCellNet: Deep learning-based tabular cell structure detection

机译:TabcellNet:基于深度学习的表格单元结构检测

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

There is an increasing demand for automated document processing techniques as the volume of electronic component documents increase. This is most prevalent in the supply chain optimization sector where vast amount of documents need to be processed and is time consuming and prone to error. Detection of tables and table structures serves as a crucial step to automate document processing. While table detection is a well investigated problem, tabular structure detection is more complex, and requires further improvements. To address this, this study proposes a deep learning model that focuses on high precision tabular cell structure detection. The proposed model creates a benchmark for the ICDAR2013 dataset cell structure with comparison to the previous state of the art table detection models as well as proposing alternative models. Our methodology approaches improving table structure detection through the detection of cells instead of row and columns for better generalization capabilities for heterogeneous table structures. Our proposed model advances prior models by improving major parts of the detection pipeline, mainly the two-stage detector, backbone, backbone architecture, and non maximum-suppression (NMS). TabCellNet consists of Hybrid Task Cascade (HTC) with Combinational Backbone Network (CBNet), dual ResNeXt101 and Soft-NMS to achieve a precision of 89.2% and recall of 98.7% on the hand annotated ICDAR2013 cell structure dataset.(c) 2021 Elsevier B.V. All rights reserved.
机译:随着电子元件文档的数量增加,对自动化文件处理技术的需求越来越大。这在供应链优化扇区中最普遍存在,其中需要处理大量文档,并且耗时且易于错误。检测表和表结构用作自动化文档处理的重要步骤。虽然表检测是一个良好的研究问题,但表格结构检测更复杂,需要进一步改进。为了解决这个问题,本研究提出了一种深入学习模型,专注于高精度的表格电池结构检测。所提出的模型为ICDAR2013数据集单元结构创建了基准,与先前的艺术表检测模型以及提出替代模型的比较。我们的方法学通过检测单元而不是行和列来改善表结构检测,以便为异构表结构更好地呈现出更好的泛化能力。我们所提出的模型通过改进检测管道的主要部分,主要是两级探测器,骨干,骨干架构和非最大抑制(NMS)来推进现有模型。 Tabcellnet由混合任务级联(HTC)组成,采用组合骨干网(CBNET),双resnext101和Soft-NMS,实现了89.2%的精度,并在手中召回了98.7%的注​​释ICDAR2013单元结构数据集。(c)2021 Elsevier BV版权所有。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号