首页> 中文期刊>计算机工程 >基于特征选择的质心向量构建方法

基于特征选择的质心向量构建方法

     

摘要

Text categorization method based on centroid shows poor performance. This paper proposes a centroid vector construction method based on feature selection named FSCC. By computing feature selection value between features and categories, the centroid vector are calculateed by the formula of centroid feature weight. Finally, a non-normalized cosine similarity measure is employed to calculate the similarity score between a text vector and a centroid. Experimental result show that FSCC significantly outperforms traditional centroid-based methods and state-of-the-art Support Vector Machine(SVM).%基于质心的文本分类方法对模型较敏感,分类性能较差.为此,提出一种基于特征选择的类别质心向量构建方法FSCC.计算特征与类别之间的特征选择值,利用质心特征权重计算公式得到类别的质心向量,并采用非归一化的余弦相似度计算文档与质心间的距离,实现文本分类.实验结果表明,与基于质心的方法和支持向量机方法相比,FSCC方法的分类效果更好.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号