首页> 外文会议>International Conference on Theory and Practice of Digital Libraries >The Clustering-Based Initialization for Non-negative Matrix Factorization in the Feature Transformation of the High-Dimensional Text Categorization System: A Viewpoint of Term Vectors
【24h】

The Clustering-Based Initialization for Non-negative Matrix Factorization in the Feature Transformation of the High-Dimensional Text Categorization System: A Viewpoint of Term Vectors

机译:基于聚类的非负矩阵分解在高维文本分类系统的特征转换中的初始化:术语向量的观点

获取原文

摘要

Due to the non-negativity of the matrix factors, Non-negative Matrix Factorization (NMF) is favorable for transforming a high-dimensional original Terms-Documents matrix into a lower-dimensional semantic Concepts-Documents matrix in the text categorization. With the iterative nature of all NMF algorithms, the NMF matrix factors need initializing. In this paper, we propose a clustering-based method for initializing the NMF according to the term vectors instead of the document vectors as the previous researches.
机译:由于矩阵因子的非消极性,非负矩阵分解(NMF)有利于将高维原始项文件矩阵转换为文本分类中的低维语义概念 - 文档矩阵。随着所有NMF算法的迭代性质,NMF矩阵因子需要初始化。在本文中,我们提出了一种基于聚类的方法,用于根据术语向量初始化NMF而不是文件向量作为先前的研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号