...
首页> 外文期刊>Information Technology Journal >Kernel Sparse Feature Selection Based on Semantics in Text Classification
【24h】

Kernel Sparse Feature Selection Based on Semantics in Text Classification

机译:基于语义的文本分类中的核稀疏特征选择

获取原文
获取原文并翻译 | 示例
           

摘要

Sparse representation originating from signal compressed sensing theory has attracted increasing interest in computer vision research community. In this paper, we present a novel non-parametric feature selection method based on sparse representation in text classification. In order to solve the problem of polysems and synonyms in VSM, we construct semantic structure to represent document with PLSA. Motivated by the fact that kernel trick can capture the nonlinear similarity of features, which may reduce the feature quantization error, we propose Empirical Kernel Sparse Representation (EKSR). We apply EKSR to reconstruct weight vector between samples, then design evaluating mechanism CKernel Sparsity Score (KSS) to select excellent feature subset. As the natural discriminative power of EKSR, KSS can find Agood@ feature which preserves the original structure with less information loss. The results of experiment both on English and Chinese dataset demonstrate the effectiveness of the proposed method.
机译:源自信号压缩传感理论的稀疏表示法引起了计算机视觉研究界的越来越多的关注。在本文中,我们提出了一种基于稀疏表示的文本分类中的非参数特征选择方法。为了解决VSM中的多义词和同义词问题,我们构造了语义结构以PLSA表示文档。由于内核技巧可以捕获特征的非线性相似性,从而可以减少特征量化误差,因此,我们提出了经验核稀疏表示(EKSR)。我们应用EKSR重建样本之间的权重向量,然后设计评估机制CKernel Sparsity Score(KSS)来选择优秀的特征子集。作为EKSR的自然判别力,KSS可以找到Agood @功能,该功能可以保留原始结构并减少信息丢失。在英汉数据集上的实验结果证明了该方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号