首页> 外国专利> Extracting and denoising concept mentions using distributed representations of concepts

Extracting and denoising concept mentions using distributed representations of concepts

机译:使用概念的分布式表示来提取和去除概念提及

摘要

A method and apparatus are provided for automatically analyzing candidate concepts extracted from a first source text against a reference concept set comprising a plurality of concepts by obtaining a vector representation for each of the concepts in the first concept set and the reference concept set and performing a natural language processing (NLP) analysis comparison of the candidate concepts to the reference concept set to determine a similarity measure corresponding to each candidate concept and validating one or more of the candidate concepts based on the similarity measure for each candidate concept meeting a minimum similarity threshold requirement.
机译:提供了一种方法和装置,用于通过为第一概念集和参考概念集中的每个概念获取矢量表示,并针对包括多个概念的参考概念集来自动分析从第一源文本中提取的候选概念。候选概念与参考概念集的自然语言处理(NLP)分析比较,以确定与每个候选概念相对应的相似性度量,并基于满足最小相似性阈值的每个候选概念的相似性度量来验证一个或多个候选概念需求。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号