首页> 外文会议>IAPR International Conference on Document Analysis and Recognition >A Rectangle Mining Method for Understanding the Semantics of Financial Tables
【24h】

A Rectangle Mining Method for Understanding the Semantics of Financial Tables

机译:理解财务表语义的矩形挖掘方法

获取原文

摘要

Financial statements report crucial information in tables with complex semantic structure, which are desirable, yet challenging, to interpret automatically. For example, in such tables a row of data cells is often explained by the headers of other rows. In a departure from prior art, we propose a rectangle mining framework for understanding complex tables, which considers rectangular regions rather than individual cells or pairs of cells in a table. We instantiate this framework with ReMine, an algorithm for extracting row header semantics of table, and show that it significantly outperforms prior pair-wise classification approaches on two datasets: (i) a set of manually labeled financial tables from multiple companies, and (ii) the ICDAR 2013 Table Competition dataset.
机译:财务报表在具有复杂语义结构的表格中报告关键信息,这是自动解释所需要的,但又具有挑战性。例如,在这样的表中,一行数据单元通常由其他行的标题解释。与现有技术不同,我们提出了一种用于理解复杂表的矩形挖掘框架,该框架考虑了矩形区域,而不是表中的单个单元格或单元格对。我们使用ReMine实例化此框架,该算法用于提取表格的行标题语义,并显示它在两个数据集上明显优于以前的成对分类方法:(i)一组来自多家公司的手动标记的财务表格,以及(ii )的ICDAR 2013表格竞赛数据集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号