首页> 外国专利> DATA EXTRACTION ENGINE FOR STRUCTURED, SEMI-STRUCTURED AND UNSTRUCTURED DATA WITH AUTOMATED LABELING AND CLASSIFICATION OF DATA PATTERNS OR DATA ELEMENTS THEREIN, AND CORRESPONDING METHOD THEREOF

DATA EXTRACTION ENGINE FOR STRUCTURED, SEMI-STRUCTURED AND UNSTRUCTURED DATA WITH AUTOMATED LABELING AND CLASSIFICATION OF DATA PATTERNS OR DATA ELEMENTS THEREIN, AND CORRESPONDING METHOD THEREOF

机译:具有数据标签或数据元素的自动标记和分类的结构化,半结构化和非结构化数据的数据提取引擎及其相应方法

摘要

A fully or semi-automated, integrated learning, labeling and classification system and method have closed, self-sustaining pattern recognition, labeling and classification operation, wherein unclassified data sets are selected and converted to an assembly of graphic and text data forming compound data sets that are to be classified. By means of feature vectors, which can be automatically generated, a machine learning classifier is trained for improving the classification operation of the automated system during training as a measure of the classification performance if the automated labeling and classification system is applied to unlabeled and unclassified data sets, and wherein unclassified data sets are classified automatically by applying the machine learning classifier of the system to the compound data set of the unclassified data sets.
机译:完全或半自动化的集成学习,标记和分类系统和方法具有封闭的,自我维持的模式识别,标记和分类操作,其中选择未分类的数据集并将其转换为图形和文本数据的组合,从而形成复合数据集被分类。通过自动生成的特征向量,可以训练机器学习分类器,以在训练过程中改进自动系统的分类操作,以作为对分类性能的度量(如果将自动标记和分类系统应用于未标记和未分类的数据)集合,其中通过将系统的机器学习分类器应用于未分类数据集的复合数据集来自动分类未分类数据集。

著录项

  • 公开/公告号EP3533004A1

    专利类型

  • 公开/公告日2019-09-04

    原文格式PDF

  • 申请/专利权人 SWISS REINSURANCE COMPANY LTD.;

    申请/专利号EP20160790919

  • 发明设计人 MÜLLER FELIX;

    申请日2016-10-26

  • 分类号G06N99;

  • 国家 EP

  • 入库时间 2022-08-21 12:26:56

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号