首页> 外国专利> AUTOMATIC GENERATION OF STRUCTURED DATA FROM SEMI-STRUCTURED DATA

AUTOMATIC GENERATION OF STRUCTURED DATA FROM SEMI-STRUCTURED DATA

机译:从半结构化数据自动生成结构化数据

摘要

A method and system for generating structured data from semi-structured data are provided. The method includes reading a plurality of records from a data file including semi-structured data. Further, the method includes obtaining aligned delimiters in a list for every record that has been read. The method also includes selecting a most occurring delimiter from the list. The method then includes constructing a regular expression using the selected delimiter to split the records into different fields. The method also includes reconstructing the records for the regular expression to fit and split into fields. In addition, the method includes displaying the records split into the fields.
机译:提供了一种用于从半结构化数据生成结构化数据的方法和系统。该方法包括从包括半结构化数据的数据文件中读取多个记录。此外,该方法包括针对已读取的每个记录在列表中获得对齐的定界符。该方法还包括从列表中选择最常出现的定界符。然后,该方法包括使用所选定界符构造正则表达式,以将记录分为不同的字段。该方法还包括为正则表达式重构记录以适合并拆分为字段。另外,该方法包括显示分成字段的记录。

著录项

  • 公开/公告号US2017316070A1

    专利类型

  • 公开/公告日2017-11-02

    原文格式PDF

  • 申请/专利权人 UNIFI SOFTWARE;

    申请/专利号US201715583966

  • 申请日2017-05-01

  • 分类号G06F17/30;G06F17/30;

  • 国家 US

  • 入库时间 2022-08-21 13:49:56

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号