首页> 外国专利> DIMENSION OPTIMIZATION IN SINGULAR VALUE DECOMPOSITION-BASED TOPIC MODELS

DIMENSION OPTIMIZATION IN SINGULAR VALUE DECOMPOSITION-BASED TOPIC MODELS

机译:基于奇异值分解的主题模型中的维数优化

摘要

Techniques are described for analyzing text. Embodiments tokenize a plurality of documents into a plurality of sets of terms. An average top dimension weight corresponding to the plurality of documents is calculated based on performing singular value decomposition (SVD) factorization for a plurality of dimension counts. An average inverse top dimension top term ranking for the plurality of documents is further calculated based on the SVD factorization for the plurality of dimension counts. A number of dimensions is determined based on the average top dimension weight and the average inverse top dimension top term ranking. A topic model is built for the plurality of documents based on the number of dimensions. The topic model is adapted to identify patterns of terms that correspond to semantic topics in at least the plurality of documents.
机译:描述了用于分析文本的技术。实施例将多个文档标记为多个术语集。基于对多个维度计数执行奇异值分解(SVD)分解,计算与多个文档相对应的平均顶级维度权重。基于所述多个维数的SVD分解,进一步计算所述多个文档的平均逆前维顶级术语排名。基于平均顶部尺寸权重和平均顶部尺寸倒数顶级排名确定尺寸。基于维数为多个文档建立主题模型。主题模型适于识别与至少多个文档中的语义主题相对应的术语的模式。

著录项

  • 公开/公告号US2019347276A1

    专利类型

  • 公开/公告日2019-11-14

    原文格式PDF

  • 申请/专利权人 THE BOEING COMPANY;

    申请/专利号US201916524983

  • 申请日2019-07-29

  • 分类号G06F16/28;G06F16/2458;G06F16/36;G06F16/93;

  • 国家 US

  • 入库时间 2022-08-21 11:22:20

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号