首页> 外国专利> PHENOMENOLOGICAL SEMANTIC DISTANCE FROM LATENT DIRICHLET ALLOCATIONS (LDA) CLASSIFICATION

PHENOMENOLOGICAL SEMANTIC DISTANCE FROM LATENT DIRICHLET ALLOCATIONS (LDA) CLASSIFICATION

机译:潜在狄利克雷分配(LDA)分类的物候语义学距离

摘要

Embodiments provide a system and method for semantic distance calculation. The method can involve receiving a plurality of documents having a set of subjects extracted through the use of latent dirichlet allocation; for each document in the plurality of documents, generating a classification list comprising a ranking of the one or more subjects based on the relevance of each subject to the document; for each classification list, calculating the semantic distance between each subject present on the classification list; aggregating the plurality of classification lists; and creating a distance matrix containing the relative semantic distances between each member of the set of subjects.
机译:实施例提供了用于语义距离计算的系统和方法。该方法可以包括:接收具有通过使用潜在狄利克雷分配而提取的一组主题的多个文档;对于多个文档中的每个文档,基于每个主题与文档的相关性,生成包括一个或多个主题的排名的分类列表;对于每个分类列表,计算分类列表上存在的每个主题之间的语义距离;汇总多个分类列表;并创建一个距离矩阵,其中包含一组主题的每个成员之间的相对语义距离。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号