首页> 外国专利> System and method for recursively traversing the internet and other sources to identify, gather, curate, adjudicate, and qualify business identity and related data

System and method for recursively traversing the internet and other sources to identify, gather, curate, adjudicate, and qualify business identity and related data

机译:用于递归遍历互联网和其他来源以识别,收集,管理,裁定和限定业务标识和相关数据的系统和方法

摘要

A system and a method used for data discovery in accordance with an inquiry in which multiple sources, which may be web sites or other data sources, are examined for data relevant to the inquiry. The process and method is performed recursively an indeterminate number of iterations, using data and metadata from multiple sources to corroborate discovered data and metadata from other sources, until no further relevant data or sources are found, or adjudication or exception rules have been met. Discovered data and metadata are curated, adjudicated to assess reliability, synthesized, and clustered into composite records using precedence rules and provenance to determine the most reliable data sources as well as terms of use for each source. Data, metadata, and information about each search are retained and can be used for subsequent purposes, such as subsequent searches or other downstream activities.
机译:一种根据查询用于数据发现的系统和方法,其中检查可能是网站或其他数据源的多个源以获取与查询有关的数据。使用来自多个源的数据和元数据来确证来自其他源的发现的数据和元数据,以递归方式执行不确定的迭代次数的过程和方法,直到找不到其他相关数据或源,或者满足裁决或例外规则。使用优先级规则和出处,对发现的数据和元数据进行整理,裁定,以评估可靠性,进行综合并组合到复合记录中,以确定最可靠的数据源以及每个数据源的使用条款。保留有关每个搜索的数据,元数据和信息,并可将其用于后续目的,例如后续搜索或其他下游活动。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号