首页> 外文会议>2010 International Conference on Computer Application and System Modeling >A mutual information and information entropy pair based feature selection method in text classification
【24h】

A mutual information and information entropy pair based feature selection method in text classification

机译:文本分类中基于互信息和信息熵对的特征选择方法

获取原文

摘要

Text classification is an important research field of data mining topics. This article brings a mutual information and information entropy pair based feature selection method (MIIEP_FS) based on the theory of information entropy and information entropy pair concept. This method measure the classification effect using feature by mutual information method and show the difference extent between the features being selected and the ones selected by information entropy. The experimental results show that the MIIEP_FS method proposed is more effective than MI and CHI methods. Macro F1 degrees of different kinds of machine learning algorithms: Naive Bayes and KNN method are higher by MIIEP_FS method, sometimes even more than the ones of support vector machines.
机译:文本分类是数据挖掘主题的重要研究领域。本文基于信息熵和信息熵对的概念,提出了一种基于信息和信息熵对的互斥特征选择方法(MIIEP_FS)。该方法利用互信息法利用特征来度量分类效果,并显示出被选择特征与信息熵所选择特征之间的差异程度。实验结果表明,提出的MIIEP_FS方法比MI和CHI方法更有效。不同种类的机器学习算法的宏F1度:MIIEP_FS方法的朴素贝叶斯和KNN方法更高,有时甚至比支持向量机更高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号