首页> 外国专利> IDENTITY RESOLUTION IN BIG, NOISY, AND/OR UNSTRUCTURED DATA

IDENTITY RESOLUTION IN BIG, NOISY, AND/OR UNSTRUCTURED DATA

机译:大数据,嘈杂数据和/或非结构化数据中的身份解析

摘要

In an environment containing big data, noisy data, and/or unstructured data, it is desirable to identify an entity referenced by input data. The entity can be identified by generating records corresponding to characteristics of the entity based on the input data. These records can be merged when it is determined that more than one record corresponds to the same entity. By doing so it is possible to more easily identify and classify information related to an entity, though such information may have been obtained in a manner that might otherwise be deemed unstructured or noisy. The method can be applied across large sets of data (“big data”) to obtain meaning from data that may otherwise be unclassifiable to a human observer.
机译:在包含大数据,嘈杂数据和/或非结构化数据的环境中,希望识别输入数据引用的实体。可以通过基于输入数据生成与实体的特征相对应的记录来识别实体。当确定多个记录对应于同一实体时,可以合并这些记录。通过这样做,有可能更容易地识别和分类与实体有关的信息,尽管这种信息可能已经以其他方式被认为是非结构化或嘈杂的方式获得。该方法可以应用于大数据集(“大数据”),以从原本无法归类为人类观察者的数据中获取含义。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号