首页> 外文会议> >A framework for forms processing using an enhanced-line-shared-adjacent format
【24h】

A framework for forms processing using an enhanced-line-shared-adjacent format

机译:使用增强的行共享相邻格式进行表单处理的框架

获取原文

摘要

The objective of this paper is to introduce a novel framework for forms processing that provides seamless processing of two different kinds of formats: physical formats, whose fields have rigidly defined positions and sizes, and topological formats, in which variations in the positions and sizes of fields are acceptable as long the topological relations between pairs of fields are preserved. A line-shared adjacent (LSA) cell relation and an LSA format are introduced to define topological formats, and are then enhanced to describe physical information as enhanced LSA (e-LSA) and an enhanced line oriented (e-LO) format. The e-LSA format has good flexibility for defining nor only physical and topological formats but also hybrid formats, on which our framework is based. It has characteristics of both physical and topological formats, and enables the framework to handle the two kinds of information seamlessly. The framework consists of four modules: a format generator; a format converter; a format class manager; and a form processor which perform all the processes for field detection on which our research focuses. In this paper the way in which these modules collaborate is illustrated with some examples, which confirms the effectiveness of our framework.
机译:本文的目的是介绍一种用于表单处理的新颖框架,该框架提供两种不同格式的无缝处理:物理格式(其字段具有严格定义的位置和大小)以及拓扑格式(其中,格式和位置的大小有所不同)只要保留字段对之间的拓扑关系,字段都是可以接受的。引入线共享相邻(LSA)单元关系和LSA格式来定义拓扑格式,然后对其进行增强以将物理信息描述为增强型LSA(e-LSA)和增强型面向行(e-LO)格式。 e-LSA格式具有很好的灵活性,不仅可以定义物理和拓扑格式,还可以定义我们的框架所基于的混合格式。它具有物理和拓扑格式的特征,并使框架能够无缝处理两种信息。该框架由四个模块组成:格式生成器;格式转换器;格式类管理器;以及一个表格处理器,它执行我们研究重点所在的所有现场检测过程。在本文中,通过一些示例说明了这些模块的协作方式,这证实了我们框架的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利