首页> 外国专利> Example-based concept-oriented data extraction method

Example-based concept-oriented data extraction method

机译:基于实例的面向概念的数据提取方法

摘要

The present invention relates to an example-based concept-orietned data extraction method. In an example labeling phase, the exemplary data string is converted into an exemplary token sequence, in which the target concepts and filler concepts are labeled to be tuples for use as an example, and thus an exemplary concept graph is constructed. In the data extraction phase, the untested data string is converted into an untested token sequence to be processed, and, based on the associated concept recognizers defined by the tuples in the example labeling phase, it is able to detect the concept candidates and establish the composite concepts and aggregate concepts, thereby constructing a hypothetical concept graph. After comparing the exemplary concept graph with the hypothetical concept graph, the optimal hypothetical concept sequence in the hypothetical graph is determined, so as to extract the targeted data from the matched target concepts.
机译:本发明涉及基于实例的概念源数据提取方法。在示例标记阶段,将示例数据串转换为示例标记序列,其中将目标概念和填充概念标记为元组以用作示例,从而构造示例概念图。在数据提取阶段,将未经测试的数据字符串转换为未经测试的令牌序列以进行处理,并且基于在示例标记阶段由元组定义的相关概念识别器,它能够检测到概念候选并建立组合概念和聚合概念,从而构建假设的概念图。在将示例性概念图与假设概念图进行比较之后,确定假设图中的最优假设概念序列,以从匹配的目标概念中提取目标数据。

著录项

  • 公开/公告号US2004123237A1

    专利类型

  • 公开/公告日2004-06-24

    原文格式PDF

  • 申请/专利权人 IND TECH RES INST;

    申请/专利号US20030442300

  • 发明设计人 CHUNG-JEN CHIU;YI-CHUNG LIN;

    申请日2003-05-21

  • 分类号G06F17/30;

  • 国家 US

  • 入库时间 2022-08-21 23:20:17

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号