首页> 外国专利> Object extraction from presentation-oriented documents using a semantic and spatial approach

Object extraction from presentation-oriented documents using a semantic and spatial approach

机译:使用语义和空间方法从面向表示的文档中提取对象

摘要

Automatic extraction of objects in a presentation-oriented document comprises receiving the presentation-oriented document (POD) in which content elements are spatially arranged in a given layout organization for presenting contents to human users; receiving a set of descriptors that semantically define the objects to extract from the POD based on attributes comprising the objects; using the set of descriptors to identify content elements in the POD that match the attributes in the set of descriptors defining the objects, and assigning semantic annotations to the identified elements based on the descriptors; creating a semantic and spatial document model (SSDM) containing spatial structures of the identified content elements in the POD and the semantic annotations assigned to the identified contents elements; extracting the identified content elements from the POD based on the set of descriptors and the SSDM to create a set of object instances; and performing at least one of: i) using the object instances to generate semantic and spatial wrappers that can be reused on a different POD, and ii) storing the object instances in a data repository.
机译:在面向演示文稿的文档中对象的自动提取包括:接收面向演示文稿的文档(POD),其中内容元素在给定的布局组织中空间排列以向人类用户展示内容;接收一组描述符,该描述符基于包括对象的属性在语义上定义要从POD中提取的对象;使用一组描述符来标识POD中与定义对象的一组描述符中的属性相匹配的内容元素,并基于这些描述符为所标识的元素分配语义注释;创建语义和空间文档模型(SSDM),其中包含POD中标识的内容元素的空间结构以及分配给标识的内容元素的语义注释;基于所述描述符集和所述SSDM从所述POD中提取所标识的内容元素,以创建对象实例集;并执行以下至少一项:i)使用对象实例生成可在不同POD上重用的语义和空间包装器,以及ii)将对象实例存储在数据存储库中。

著录项

  • 公开/公告号US9582494B2

    专利类型

  • 公开/公告日2017-02-28

    原文格式PDF

  • 申请/专利权人 ALTILIA S.R.L.;

    申请/专利号US201313774289

  • 发明设计人 ERMELINDA ORO;MASSIMO RUFFOLO;

    申请日2013-02-22

  • 分类号G06F17;G06F17/27;

  • 国家 US

  • 入库时间 2022-08-21 13:42:24

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号