首页> 外国专利> GRAMMATICAL PARSING OF DOCUMENT VISUAL STRUCTURES

GRAMMATICAL PARSING OF DOCUMENT VISUAL STRUCTURES

机译:文件视觉结构的语法分析

摘要

A two-dimensional representation of a document is leveraged to extract a hierarchical structure that facilitates recognition of the document. The visual structure is grammatically parsed utilizing two-dimensional adaptations of statistical parsing algorithms. This allows recognition of layout structures (e.g., columns, authors, titles, footnotes, etc.) and the like such that structural components of the document can be accurately interpreted. Additional techniques can also be employed to facilitate document layout recognition. For example, grammatical parsing techniques that utilize machine learning, parse scoring based on image representations, boosting techniques, and/or fast features and the like can be employed to facilitate in document recognition.
机译:利用文档的二维表示来提取有助于识别文档的层次结构。使用统计解析算法的二维改编对视觉结构进行语法解析。这允许识别布局结构(例如,列,作者,标题,脚注等)等,从而可以准确地解释文档的结构成分。还可以采用其他技术来促进文档布局识别。例如,可以采用利用机器学习的语法分析技术,基于图像表示的语法评分,增强技术和/或快速特征等,以促进文档识别。

著录项

  • 公开/公告号EP1894144A4

    专利类型

  • 公开/公告日2012-12-26

    原文格式PDF

  • 申请/专利权人 MICROSOFT CORPORATION;

    申请/专利号EP20060786329

  • 发明设计人 VIOLA PAUL A.;SHILMAN MICHAEL;

    申请日2006-06-30

  • 分类号G06K9/72;G06F40;

  • 国家 EP

  • 入库时间 2022-08-21 16:35:52

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号