首页> 外文会议>International Conference on Document Analysis and Recognition >Challenges in End-to-End Neural Scientific Table Recognition
【24h】

Challenges in End-to-End Neural Scientific Table Recognition

机译:端到端神经科学表格识别中的挑战

获取原文

摘要

In recent years, end-to-end trained neural models have been applied successfully to various optical character recognition (OCR) tasks. However, the same level of success has not yet been achieved in end-to-end neural scientific table recognition, which involves multi-row/multi-column structures and math formulas in cells. In this paper, we take a step forward to full end-to-end scientific table recognition by constructing a large dataset consisting of 450K table images paired with corresponding LaTeX sources. We apply a popular attentional encoder-decoder model to this dataset and show the potential of end-to-end trained neural systems, as well as associated challenges.
机译:近年来,端到端训练的神经模型已成功应用于各种光学字符识别(OCR)任务。但是,在端到端神经科学表格识别中还没有达到相同的成功水平,这种识别涉及单元格中的多行/多列结构和数学公式。在本文中,我们通过构建由450K表格图像和相应的LaTeX源配对而成的大型数据集,朝着全面的端到端科学表格识别迈出了一步。我们将流行的注意力编码器-解码器模型应用于此数据集,并展示了端到端训练的神经系统的潜力以及相关的挑战。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号