首页> 外国专利> Dimension optimization in singular value decomposition-based topic models

Dimension optimization in singular value decomposition-based topic models

机译:基于奇异值分解的主题模型中的维度优化

摘要

Techniques are described for analyzing text. Embodiments tokenize a plurality of documents into a plurality of sets of terms. An average top dimension weight corresponding to the plurality of documents is calculated based on performing singular value decomposition (SVD) factorization for a plurality of dimension counts. An average inverse top dimension top term ranking for the plurality of documents is further calculated based on the SVD factorization for the plurality of dimension counts. A number of dimensions is determined based on the average top dimension weight and the average inverse top dimension top term ranking. A topic model is built for the plurality of documents based on the number of dimensions. The topic model is adapted to identify patterns of terms that correspond to semantic topics in at least the plurality of documents.
机译:描述了用于分析文本的技术。 实施例将多个文档赋成多组术语。 基于对多个尺寸计数的执行奇异值分解(SVD)分解来计算与多个文档对应的平均顶部尺寸重量。 基于多个维数计数的SVD分解,进一步计算多个文档的平均逆顶尺寸顶部术语排名。 基于平均顶部尺寸重量和平均反向顶部尺寸顶部排名确定了许多尺寸。 基于维度的数量为多个文档构建主题模型。 主题模型适于识别与至少多个文档中的语义主题对应的术语模式。

著录项

  • 公开/公告号US11120051B2

    专利类型

  • 公开/公告日2021-09-14

    原文格式PDF

  • 申请/专利权人 THE BOEING COMPANY;

    申请/专利号US201916524983

  • 申请日2019-07-29

  • 分类号G06F16/93;G06F16/28;G06F16/2458;G06F16/36;

  • 国家 US

  • 入库时间 2022-08-24 21:01:25

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号