首页> 外国专利> Disambiguation method of features in unstructured text

Disambiguation method of features in unstructured text

机译:非结构化文本中特征的消歧方法

摘要

A method for disambiguating features in unstructured text is provided. The disclosed method may not require pre-existing links to be present. The method for disambiguating features in unstructured text may use co-occurring features derived from both the source document and a large document corpus. The disclosed method may include multiple modules, including a linking module for linking the derived features from the source document to the co-occurring features of an existing knowledge base. The disclosed method for disambiguating features may allow identifying unique entities from a knowledge base that includes entities with a unique set of co-occurring features, which in turn may allow for increased precision in knowledge discovery and search results, employing advanced analytical methods over a massive corpus, employing a combination of entities, co-occurring entities, topic IDs, and other derived features.
机译:提供了一种用于消除非结构化文本中的特征的歧义的方法。所公开的方法可能不需要存在预先存在的链接。用于对非结构化文本中的特征进行歧义消除的方法可以使用从源文档和大型文档语料库中导出的同现特征。所公开的方法可以包括多个模块,包括用于将从源文档导出的特征链接到现有知识库的同时出现的特征的链接模块。所公开的用于消除歧义的方法可以允许从包括具有共同出现的特征的唯一集合的实体的知识库中识别唯一的实体,这继而可以允许在大规模使用高级分析方法的情况下提高知识发现和搜索结果的精度。语料库,采用实体,共同出现的实体,主题ID和其他派生特征的组合。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号