首页> 外国专利> DIMENSION OPTIMIZATION IN SINGULAR VALUE DECOMPOSITION-BASED TOPIC MODELS

DIMENSION OPTIMIZATION IN SINGULAR VALUE DECOMPOSITION-BASED TOPIC MODELS

机译:基于奇异值分解的主题模型中的维数优化

摘要

Techniques are described for optimizing a number of dimensions for performing a singular value decomposition (SVD) factorization. Embodiments tokenize each of a plurality of documents into a respective set of terms. For each of a plurality of dimension counts, embodiments perform the SVD factorization to determine a respective plurality of dimensions, the respective plurality of dimensions corresponding to the dimension count, determine, for each of the plurality of documents, a respective set of dimension weights for each of the plurality of dimensions, calculate an average top dimension weight across the sets of dimension weights for the plurality of documents and calculate an average inverse top dimension top term ranking across the sets of dimension weights for the plurality of documents. An optimal number of dimensions is calculated, based on the average top dimension weight and the average inverse top dimension top term ranking.
机译:描述了用于优化多个维度以执行奇异值分解(SVD)分解的技术。实施例将多个文档中的每一个标记成相应的术语集。对于多个维度计数中的每一个,实施例执行SVD分解以确定相应的多个维度,与该维度计数相对应的相应的多个维度,针对多个文档中的每一个,确定用于多个维度中的每个维度,针对多个文档的维度权重集计算平均顶级维度权重,并针对多个文档的维度权重集计算平均逆维度顶级词条排名。根据平均顶部尺寸权重和平均顶部尺寸倒数排名,计算出最佳尺寸。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号