首页> 外文会议>International Workshop on Graphics Recognition >Extraction of Index Components Based on Contents Analysis of Journal’s Scanned Cover Page
【24h】

Extraction of Index Components Based on Contents Analysis of Journal’s Scanned Cover Page

机译:基于日志扫描封面的内容分析提取索引组分

获取原文

摘要

In this paper, a method for automatically indexing the contents to reduce the effort that used to be required for input paper information and constructing index is sought. Various contents formats for journals, which have different features from those for general documents, are described. The principal elements that we want to represent are titles, authors, and pages for each paper. Thus, the three principal elements are modeled according to the order of their arrangement, and then their features are generalized. The content analysis system is then implemented based on the suggested modeling method. The content analysis system, implemented for verifying the suggested method, gets its input in the form containing more than 300 dpi gray scale image and analyze structural features of the contents. It classifies titles, authors and pages using efficient projection method. The definition of each item is classified according to regions, and then is extracted automatically as index information. It also helps to recognize characters region by region. The experimental result is obtained by applying to some of the suggested 6 models, and the system shows 97.3% success rate for various journals.
机译:在本文中,寻求自动索引内容以减少输入纸信息和构建指数所需的努力的方法。描述了具有常规文档的不同特征的过程的各种内容格式。我们想要代表的主要元素是每份纸张的标题,作者和页面。因此,三个主要元素根据其排列的顺序进行建模,然后它们的特征是概括的。然后基于建议的建模方法实现内容分析系统。实现用于验证建议的方法的内容分析系统,以包含超过300 dpi灰度图像的形式输入其输入,并分析内容的结构特征。它使用有效的投影方法对其进行分类,作者和页面。每个项目的定义根据区域分类,然后将自动提取为索引信息。它还有助于按区域识别角色区域。通过申请一些建议的6种模型来获得实验结果,系统显示各种期刊的97.3%的成功率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号