首页> 外国专利> METHOD AND APPARATUS OF PROCESSING SEMISTRUCTURED TEXTUAL DATA INTO PREDETERMINED DATA STRUCTURES DEFINED BY A STRUCTURE DEFINITION

METHOD AND APPARATUS OF PROCESSING SEMISTRUCTURED TEXTUAL DATA INTO PREDETERMINED DATA STRUCTURES DEFINED BY A STRUCTURE DEFINITION

机译:将半结构化文本数据处理为由结构定义确定的预定数据结构的方法和装置

摘要

A method of processing semistructured data, in particular semistructured textual data, to output data which is in accordance with a predetermined structure, wherein said semistructured data is structured into one or more elements according to a given syntax, the actual content of the syntax elements being variable and being called a token, said method comprising: extracting by means of an extractor (parser) from said semistructured data one or more tokens, said parser being capable of returning at least one token in response to a respective specific command identifying the requested token by a token identifier, wherein said method further comprises: providing a sequence of commands and an associated data structure definition, both together being called a loader, said loader comprising the commands necessary to cause said parser to return the one or more tokens to be extracted; causing by said sequence of commands of said loader said parser to extract said one or more tokens from said semistructured data and further converting said extracted tokens into said predetermined data structure defined by said associated structure definition.
机译:一种处理半结构化数据,特别是半结构化文本数据,以输出符合预定结构的数据的方法,其中,根据给定的语法将所述半结构化数据构造为一个或多个元素,语法元素的实际内容为变量并称为令牌,所述方法包括:借助于提取器(解析器)从所述半结构化数据中提取一个或多个令牌,所述解析器能够响应于标识所请求令牌的相应特定命令而返回至少一个令牌。通过令牌标识符,其中所述方法还包括:提供一系列命令和相关联的数据结构定义,两者均称为加载程序,所述加载程序包括使所述解析器返回要提取的一个或多个令牌所必需的命令;通过所述加载器的所述命令序列使所述解析器从所述半结构化数据中提取所述一个或多个令牌,并将所述提取的令牌进一步转换为由所述相关联的结构定义所定义的所述预定数据结构。

著录项

  • 公开/公告号US2003055849A1

    专利类型

  • 公开/公告日2003-03-20

    原文格式PDF

  • 申请/专利权人 THURE ETZOLD;THIERRY COUPAYE;

    申请/专利号US19990475255

  • 发明设计人 ETZOLD THURE;COUPAYE THIERRY;

    申请日1999-12-30

  • 分类号G06F17/00;

  • 国家 US

  • 入库时间 2022-08-22 00:11:21

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号