首页> 外文会议>International Conference on Artificial Intelligence and Computational Intelligence >Dimensionality Reduction with Category Information Fusion and Non-negative Matrix Factorization for Text Categorization
【24h】

Dimensionality Reduction with Category Information Fusion and Non-negative Matrix Factorization for Text Categorization

机译:文本分类的类别信息融合和非负矩阵分解的维数减少

获取原文

摘要

Dimensionality reduction can efficiently improve computing performance of classifiers in text categorization, and non-negative matrix factorization could map the high dimensional term space into a low dimensional semantic subspace easily. Meanwhile, the non-negative of the basis vectors could provide a meaningful explanation for the semantic subspace. However, it usually could not achieve a satisfied classification performance because it is sensitive to the noise, data missing and outlier as a linear reconstruction method. This paper proposes a novel approach in which the train text and its category information are fused and a transformation matrix that maps the term space into a semantic subspace is obtained by a basis orthogonality non-negative matrix factorization and truncation. Finally, the dimensionality can be reduced aggressively with these transformations. Experimental results show that the proposed approach remains a good classification performance in a very low dimensional case.
机译:减少维度可以有效地改善文本分类中分类器的计算性能,并且非负矩阵分解可以容易地将高维术语空间映射到低维语义子空间。同时,基础向量的非负数可以为语义子空间提供有意义的解释。但是,它通常无法达到满意的分类性能,因为它对噪声,数据丢失和作为线性重建方法的异常敏感。本文提出了一种新的方法,其中列车文本及其类别信息被融合,并且将术语空间映射到语义子空间中的转换矩阵是通过基础正交非负矩阵分解和截断而获得的。最后,这些转换可以积极地减少维度。实验结果表明,该方法在非常低的尺寸情况下仍然是良好的分类性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号