首页> 外文会议> >Integrated algorithms for newspaper page decomposition and article tracking
【24h】

Integrated algorithms for newspaper page decomposition and article tracking

机译:报纸页面分解和文章跟踪的集成算法

获取原文

摘要

The conversion of newspaper pages into digital resources is an important task that greatly contributes to the preservation of and access to newspaper archives. In this paper, an integrated methodology is presented for segmenting newspaper pages and identifying newspaper articles. In the first stage, a succession of image processing and document analysis algorithms is employed for segmenting newspaper page images into various objects (text, images and drawings, titles). A rule based approach is subsequently applied to the objects identified during the page segmentation phase for reconstructing individual articles. Experimental results, obtained from a large testbed of old newspaper issues, are presented which clearly demonstrate the applicability of our integrated approach to successful newspaper page segmentation and identification of newspaper articles.
机译:将报纸页面转换为数字资源是一项重要任务,极大地促进了报纸档案的保存和访问。在本文中,提出了一种用于分割报纸页面和识别报纸文章的集成方法。在第一阶段,采用一系列图像处理和文档分析算法将报纸页面图像分割成各种对象(文本,图像和图形,标题)。随后将基于规则的方法应用于在页面分割阶段期间标识的对象,以重建单个文章。实验结果是从旧报纸的大型测试平台上获得的,这些结果清楚地证明了我们的综合方法在成功进行报纸页面分割和报纸文章识别方面的适用性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号