首页> 外文期刊>Information Processing & Management >Supervised Hebb rule based feature selection for text classification
【24h】

Supervised Hebb rule based feature selection for text classification

机译:基于监督Hebb规则的文本分类特征选择

获取原文
获取原文并翻译 | 示例
       

摘要

Text documents usually contain high dimensional non-discriminative (irrelevant and noisy) terms which lead to steep computational costs and poor learning performance of text classification. One of the effective solutions for this problem is feature selection which aims to identify discriminative terms from text data. This paper proposes a method termed “Hebb rule based feature selection (HRFS)”. HRFS is based on supervised Hebb rule and assumes that terms and classes are neurons and select terms under the assumption that a term is discriminative if it keeps “exciting” the corresponding classes. This assumption can be explained as “a term is highly correlated with a class if it is able to keep “exciting” the class according to the original Hebb postulate. Six benchmarking datasets are used to compare HRFS with other seven feature selection methods. Experimental results indicate that HRFS is effective to achieve better performance than the compared methods. HRFS can identify discriminative terms in the view of synapse between neurons. Moreover, HRFS is also efficient because it can be described in the view of matrix operation to decrease complexity of feature selection.
机译:文本文档通常包含高维的非歧视性(无关且嘈杂)的术语,这会导致较高的计算成本和较差的文本分类学习性能。解决此问题的有效方法之一是特征选择,其目的是从文本数据中识别出有区别的术语。本文提出了一种称为“基于Hebb规则的特征选择(HRFS)”的方法。 HRFS基于监督的Hebb规则,并假设术语和类别是神经元,并在假设术语保持“激发”相应类别的前提下具有歧视性的前提下选择术语。这个假设可以解释为“如果一个词能够按照原始的赫布假设保持“激发”该类,则它与该类高度相关。六个基准数据集用于将HRFS与其他七个特征选择方法进行比较。实验结果表明,与比较方法相比,HRFS可有效实现更好的性能。 HRFS可以从神经元之间的突触角度识别歧视性术语。此外,HRFS也是有效的,因为可以从矩阵运算的角度对其进行描述以降低特征选择的复杂性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号