首页> 外文会议>Document Recognition III >Genetic approach to the analysis of complex text formatting
【24h】

Genetic approach to the analysis of complex text formatting

机译:遗传方法分析复杂的文本格式

获取原文

摘要

Abstract: Traditional document analysis systems often adopt atop-down framework, i.e., they are composed of variouslocally interacting functional components, guided by acentral control mechanism. The design of each componentis determined by a human expert and is optimized for agiven class of inputs. Such a system can fail whenconfronted by an input that falls outside itsanticipated domain. This paper investigates the use ofa genetic-based adaptive mechanism in the analysis ofcomplex test formatting. Specifically, we explore agenetic approach to the binarization problem. Asopposed to a single, pre-defined, 'optimal'thresholding scheme, the genetic-based process appliesvarious known methods and evaluates their effectivenesson the input image. Individual regions are treatedindependently, while the genetic algorithm attempts tooptimize the overall result for the entire page.Advantages and disadvantages of this approach arediscussed. !12
机译:摘要:传统的文档分析系统通常采用自上而下的框架,即它们由中央控制机制指导的各种局部交互功能组件组成。每个组件的设计均由专家确定,并针对给定的输入类别进行了优化。当面对超出其预期范围的输入时,这样的系统可能会失败。本文研究了基于遗传的自适应机制在复杂测试格式分析中的应用。具体来说,我们探索了一种二元化问题的遗传方法。与单个预定义的“最佳”阈值方案相反,基于遗传的过程应用了各种已知方法,并评估了它们在输入图像上的有效性。个体区域被独立地对待,而遗传算法试图优化整个页面的整体结果。讨论了这种方法的优缺点。 !12

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号