首页> 外文会议>IAPR International Conference on Document Analysis and Recognition >Full-Page Text Recognition: Learning Where to Start and When to Stop
【24h】

Full-Page Text Recognition: Learning Where to Start and When to Stop

机译:全页文本识别:学习开始和何时停止的地方

获取原文

摘要

Text line detection and localization is a crucial step for full page document analysis, but still suffers from heterogeneity of real life documents. In this paper, we present a new approach for full page text recognition. Localization of the text lines is based on regressions with Fully Convolutional Neural Networks and Multidimensional Long Short-Term Memory as contextual layers. In order to increase the efficiency of this localization method, only the position of the left side of the text lines are predicted. The text recognizer is then in charge of predicting the end of the text to recognize. This method has shown good results for full page text recognition on the highly heterogeneous Maurdor dataset.
机译:文本线路检测和本地化是全页文档分析的关键步骤,但仍然存在现实生活文件的异质性。在本文中,我们提出了一种全新的全页文本识别方法。文本线的本地化基于具有完全卷积神经网络的回归和多维的长短期存储器作为上下文层。为了提高这种定位方法的效率,仅预测文本线的左侧的位置。然后,文本识别器负责预测要识别的文本的末尾。在高度异构Maurdor数据集上,此方法显示出完整页面文本识别的良好结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号