首页> 外文会议>2010 Second WRI Global Congress on Intelligent Systems >A Method of Text Feature Extraction Based on Weighted Scatter Difference
【24h】

A Method of Text Feature Extraction Based on Weighted Scatter Difference

机译:基于加权分散差的文本特征提取方法

获取原文

摘要

Feature reduction is one of the core technologies of automatic text categorization. As for the scatter difference criterion, poor categorization effect is made when the between-class distance is small and the class density is high. In order to solve this problem, a weighted method based on the sample distribution is shown in the paper, which will make the between-class and within-class scatter matrixes with poor scatter be weighted, to enhance the categorization ability after dimensional reduction and to improve the dimensional reduction effect of linear feature extraction method based on scatter difference. The following experiment tells us that this method is superior to the original maximum scatter difference method in precision rate and recall rate.
机译:特征缩减是自动文本分类的核心技术之一。至于散布差异标准,当类间距离小和类密度高时,分类效果差。为了解决这个问题,本文提出了一种基于样本分布的加权方法,该方法可以对散布不良的类间散布矩阵和类内散布矩阵进行加权,以提高降维后的分类能力。提高了基于散度差的线性特征提取方法的降维效果。下面的实验告诉我们,该方法在准确率和查全率方面优于原始的最大散射差法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号