首页> 外国专利> Method and program for efficiently structuring and correcting open data

Method and program for efficiently structuring and correcting open data

机译:用于有效地构造和纠正开放数据的方法和程序

摘要

Problem to be solved: to provide an information processing method and a program which automatically collects data which can be accessed on a network and extracts information and automatically constructs it by processing such as machine learning, and corrects structured data.Information processing apparatusAccept network address as inputA data acquisition section 20 for acquiring a markup language or the like constituting a web page at an address as text data andGrasp the network address derived from the acquired text dataDirectory structure extractor 21 to construct the directory structureTo make it easier to process text dataData cleansing section 22 to perform formatting such as excluding extra charactersFrom text data formatted by data cleansing sectionExtract the necessary informationA data structure section 23 structured by applying tags to each extracted information and.Diagram
机译:要解决的问题:提供信息处理方法和程序,该程序自动收集可以在网络上访问的数据并通过机器学习(如机器学习)自动构建信息,并纠正结构化数据。信息处理设备公开网络地址作为输入数据获取部分20,用于在地址上获取构成网页的标记语言等作为文本数据和从获取的文本数据向导结构提取器21派生的网络地址构建目录STRACHURETO使其更容易要处理文本DataData清除部分22以执行格式,例如通过数据清理部分格式化的文本数据排除额外的字符,通过将标签应用于每个提取的信息和adiagram来构造的必要的InformationA数据结构部分23

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号