首页> 外文期刊>Engineering Applications of Artificial Intelligence >Global Filter-Wrapper method based on class-dependent correlation for text classification
【24h】

Global Filter-Wrapper method based on class-dependent correlation for text classification

机译:基于类相关的全局Filter-Wrapper方法用于文本分类

获取原文
获取原文并翻译 | 示例
           

摘要

Text classification suffers from the high dimensionality and sparseness of the feature space. Feature selection (FS) is known as an important stage of the pre-processing phase. Most recently, Point-wise Mutual Information (PMI), a common concept in information theory, has been used as an effective and widely adapted approach for FS. The FS method proposed in this paper uses a hybrid approach to propose a new global PMI-based FS method. In it, the advantages of both the filter approach and the wrapper approach are combined in a different way. In the first phase, the ranking-based filter approach is implemented for FS by applying the information gain method. In the second phase, the subset selection-based filter approach is implemented for FS by introducing the global PMI-based FS method designed based on two basic principles: (1) class-dependent assumption for computing the correlation between pairs of features, and (2) an embedded wrapper approach. The results showed that the hybrid proposed method can produce better results than the state-of-the-art FS methods in both classification performance and dimension reduction.
机译:文本分类遭受特征空间的高维和稀疏性的困扰。功能选择(FS)被称为预处理阶段的重要阶段。最近,信息理论中的常用概念逐点相互信息(PMI)已被用作FS的一种有效且广泛适用的方法。本文提出的FS方法使用一种混合方法来提出一种新的基于PMI的全局FS方法。其中,过滤器方法和包装器方法的优点以不同的方式结合在一起。在第一阶段,通过应用信息增益方法为FS实现基于排名的过滤器方法。在第二阶段,通过引入基于两个基本原理设计的基于全局PMI的FS方法,为FS实现基于子集选择的过滤器方法:(1)用于计算特征对之间相关性的类相关假设;以及( 2)嵌入式包装方法。结果表明,在分类性能和降维方面,提出的混合方法都可以比最新的FS方法产生更好的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号