首页> 外文会议>International Conference on Document Analysis and Recognition >Integrated algorithms for newspaper page decomposition and article tracking
【24h】

Integrated algorithms for newspaper page decomposition and article tracking

机译:报纸页分解和文章跟踪的集成算法

获取原文

摘要

The conversion of newspaper pages into digital resources is an important task that greatly contributes to the preservation of and access to newspaper archives. In this paper, an integrated methodology is presented for segmenting newspaper pages and identifying newspaper articles. In the first stage, a succession of image processing and document analysis algorithms is employed for segmenting newspaper page images into various objects (text, images and drawings, titles). A rule based approach is subsequently applied to the objects identified during the page segmentation phase for reconstructing individual articles. Experimental results, obtained from a large testbed of old newspaper issues, are presented which clearly demonstrate the applicability of our integrated approach to successful newspaper page segmentation and identification of newspaper articles.
机译:报纸页面转换为数字资源是一项重要的任务,极大地有助于保存和获取报纸档案。本文介绍了分段报纸页面和识别报纸文章的综合方法。在第一阶段中,使用图像处理和文档分析算法用于将报纸页面图像分割成各种对象(文本,图像和图形,标题)。随后将基于规则的方法应用于在页面分段阶段期间识别的对象以重建单个物品。提出了从旧报纸问题的大型检测器获得的实验结果,这清楚地证明了我们综合方法对成功的报纸页面细分和报纸文章的识别的适用性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号