首页> 外文会议>IAPR International Conference on Document Analysis and Recognition >Robust, Simple Page Segmentation Using Hybrid Convolutional MDLSTM Networks
【24h】

Robust, Simple Page Segmentation Using Hybrid Convolutional MDLSTM Networks

机译:使用混合卷积MDLSTM网络进行鲁棒,简单的页面分割

获取原文

摘要

Analyzing and segmenting scanned documents is an important step in optical character recognition. The problem is difficult because of the complexity of 2D layouts, the small tolerance of segmentation errors in the output, and the relatively small amount of labeled training data available. Traditional approaches have relied on a combination of sophisticated geometric algorithms, domain knowledge, heuristics, and carefully tuned parameters. This paper describes the use of deep neural networks, in particular a combination of convolutional and multidimensional LSTM networks, for document image and demonstrates that relatively simple networks are capable of fast, reliable text line segmentation and document layout analysis even on complex and noisy inputs, without manual parameter tuning or heuristics. The method is easily adaptable to new datasets by retraining and an open source implementation is available.
机译:分析和分割扫描的文档是光学字符识别的重要步骤。由于2D布局的复杂性,输出中分割错误的容忍度较小以及可用的标记训练数据量相对较少,因此该问题很难解决。传统方法依赖于复杂的几何算法,领域知识,启发法和精心调整的参数的组合。本文介绍了深度神经网络(尤其是卷积和多维LSTM网络的组合)在文档图像中的使用,并展示了相对简单的网络即使在复杂且嘈杂的输入下也能够进行快速,可靠的文本行分割和文档布局分析,无需手动参数调整或试探法。通过重新培训,该方法很容易适应新的数据集,并且可以使用开源实现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号