首页> 外文会议> >Page grammars and page parsing. A syntactic approach to document layout recognition
【24h】

Page grammars and page parsing. A syntactic approach to document layout recognition

机译:页面语法和页面解析。文档布局识别的语法方法

获取原文

摘要

Describes a syntactic approach to deducing the logical structure of printed documents from their physical layout. Page layout is described by a two-dimensional grammar, similar to a context-free string grammar, and a chart parser is used to parse segmented page images according to the grammar. This process is part of a system which reads scanned document images and produces computer-readable text in a logical mark-up format such as SGML. The system is briefly outlined, the grammar formalism and the parsing algorithm are described in detail, and some experimental results are reported.
机译:描述一种语法方法,用于从其物理布局中推断出打印文档的逻辑结构。页面布局由二维语法描述,类似于无上下文的字符串语法,并且图表解析器用于根据语法解析分段的页面图像。此过程是系统的一部分,该系统读取扫描的文档图像并以逻辑标记格式(例如SGML)生成计算机可读文本。简要概述了该系统,详细描述了语法形式和解析算法,并报告了一些实验结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号