首页> 外国专利> Building classification and extraction models based on electronic forms

Building classification and extraction models based on electronic forms

机译:基于电子表格的建筑物分类和提取模型

摘要

According to one embodiment, a computer-implemented method is configured for building a classification and/or data extraction knowledge base using an electronic form. The method includes: receiving an electronic form having associated therewith a plurality of metadata labels, each metadata label corresponding to at least one element of interest represented within the electronic form; parsing the plurality of metadata labels to determine characteristic features of the element(s) of interest; building a representation of the electronic form based on the plurality of metadata labels; generating a plurality of permutations of the representation of the electronic form by applying a predetermined set of variations to the representation; and training either a classification model, an extraction model, or both using: the representation of the electronic form, and the plurality of permutations of the representation of the electronic form. Corresponding systems and computer program products are also disclosed.
机译:根据一个实施例,一种计算机实现的方法被配置用于使用电子表格来建立分类和/或数据提取知识库。该方法包括:接收具有与其关联的多个元数据标签的电子表格,每个元数据标签对应于在电子表格内表示的至少一个感兴趣的元素;以及解析多个元数据标签以确定感兴趣的元素的特征;基于多个元数据标签建立电子表格的表示;通过将预定的一组变体应用于表示来生成电子表格的表示的多个排列;以及使用以下方式训练分类模型,提取模型或同时使用这两种方法:电子表单的表示以及电子表单的表示的多个排列。还公开了相应的系统和计算机程序产品。

著录项

  • 公开/公告号US10140511B2

    专利类型

  • 公开/公告日2018-11-27

    原文格式PDF

  • 申请/专利权人 KOFAX INC.;

    申请/专利号US201615396322

  • 申请日2016-12-30

  • 分类号G06K9;G06F17/30;G06K9/20;

  • 国家 US

  • 入库时间 2022-08-21 12:09:04

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号