首页> 外文会议>IAPR International Conference on Document Analysis and Recognition >A Rectangle Mining Method for Understanding the Semantics of Financial Tables
【24h】

A Rectangle Mining Method for Understanding the Semantics of Financial Tables

机译:一种了解金融表的语义的矩形挖掘方法

获取原文

摘要

Financial statements report crucial information in tables with complex semantic structure, which are desirable, yet challenging, to interpret automatically. For example, in such tables a row of data cells is often explained by the headers of other rows. In a departure from prior art, we propose a rectangle mining framework for understanding complex tables, which considers rectangular regions rather than individual cells or pairs of cells in a table. We instantiate this framework with ReMine, an algorithm for extracting row header semantics of table, and show that it significantly outperforms prior pair-wise classification approaches on two datasets: (i) a set of manually labeled financial tables from multiple companies, and (ii) the ICDAR 2013 Table Competition dataset.
机译:财务报表报告具有复杂语义结构的表格中的重要信息,这是可取的,但具有挑战性的,以自动解释。例如,在这样的表中,通常由其他行的标题解释一行数据单元。在现有技术的偏离中,我们提出了一个矩形挖掘框架,用于了解复杂的表,其考虑矩形区域而不是表中的单个小区或一对细胞。我们将此框架实例化了eMINE,一种用于提取表的行报头语义的算法,并表明它显着优于两个数据集上的先前对分类方法:(i)来自多家公司的一组手动标记的金融表,以及(II) )ICDAR 2013表比赛数据集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号