首页>
外国专利>
DIMENSION OPTIMIZATION IN SINGULAR VALUE DECOMPOSITION-BASED TOPIC MODELS
DIMENSION OPTIMIZATION IN SINGULAR VALUE DECOMPOSITION-BASED TOPIC MODELS
展开▼
机译:基于奇异值分解的主题模型中的维数优化
展开▼
页面导航
摘要
著录项
相似文献
摘要
Techniques are described for analyzing text. Embodiments tokenize a plurality of documents into a plurality of sets of terms. An average top dimension weight corresponding to the plurality of documents is calculated based on performing singular value decomposition (SVD) factorization for a plurality of dimension counts. An average inverse top dimension top term ranking for the plurality of documents is further calculated based on the SVD factorization for the plurality of dimension counts. A number of dimensions is determined based on the average top dimension weight and the average inverse top dimension top term ranking. A topic model is built for the plurality of documents based on the number of dimensions. The topic model is adapted to identify patterns of terms that correspond to semantic topics in at least the plurality of documents.
展开▼