首页> 外文期刊>International journal of computer science and network security >A Novel Feature Selection Method Based on Category Information Analysis for Class Prejudging in Text Classification
【24h】

A Novel Feature Selection Method Based on Category Information Analysis for Class Prejudging in Text Classification

机译:一种基于类别信息分析的新型特征选择方法,用于文本分类中的类偏见

获取原文
       

摘要

This paper presents a new feature selection algorithm with the category information analysis in text classification. The algorithm obscure or reduce the noises of text features by computing the feature contribution with word and document frequency and introducing variance mechanism to mine the latent category information. The algorithm is distinguished from others by providing a pre-fetching technique for classifier while it is compatible with efficient feature selection, which means that the classifier can actively prejudge the candidate class labels to unseen documents using the category information linked to features and classify them in the candidate class space to retrench time expenses. The experimental results on Chinese and English corpus show that the algorithm achieves a high performance. The F measure is 0.73 and 0.93 respectively and the run efficiency of classifier is improved greatly.
机译:本文提出了一种新的特征选择算法,文本分类中的类别信息分析。该算法通过计算单词和文档频率的功能贡献并引入潜在类别信息来掩盖或减少文本特征的噪声。通过提供用于分类的预取技术,在与有效的特征选择兼容的情况下,该算法与其他算法区分开,这意味着分类器可以使用链接到特征的类别信息来主动将候选类标签用于解除文档,并将其分类候选人级空间来追击时间费用。中英文语料库的实验结果表明,该算法实现了高性能。 F度量分别为0.73和0.93,分类器的运行效率大大提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号