首页> 外文会议>Fuzzy Systems and Knowledge Discovery, 2009. FSKD '09 >A Text Classification Method with an Effective Feature Extraction Based on Category Analysis
【24h】

A Text Classification Method with an Effective Feature Extraction Based on Category Analysis

机译:基于类别分析的有效特征提取文本分类方法

获取原文

摘要

Text classification refers to determine the class of an unknown text according to its content in the given classification system. In order to extract fewer features to express the information in the text as much as possible, the paper analysis the various featuresȁ9; statistical properties and to extract the global features according to Zipf''s law; and then, based on the statistical analysis of the featuresȁ9; classified information, the efficient feature is extracted by computing the contribute of a feature; After that, the traditional TF-IDF formula is improved using category frequencies named by TF-IDF-CF for calculating the feature weight; Finally the text classification method is proposed. The experiment results illustrate that feature extraction methods proposed in the paper are effective and the formula TF-IDF-CF for calculating the feature weight has higher classification accuracy.
机译:文本分类是指在给定的分类系统中根据其内容确定未知文本的类别。为了尽可能少地提取特征以在文本中表达信息,本文对各种特征进行了分析[9]。统计特性,并根据Zipf定律提取全局特征;然后,基于特征ȁ9的统计分析;分类信息,通过计算特征的贡献来提取有效特征;此后,使用TF-IDF-CF命名的类别频率来计算特征权重,从而改进了传统的TF-IDF公​​式;最后提出了文本分类方法。实验结果表明,本文提出的特征提取方法是有效的,计算特征权重的公式TF-IDF-CF具有较高的分类精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号