...
首页> 外文期刊>Systems Engineering and Electronics, Journal of >Lazy learner text categorization algorithm based on embedded feature selection
【24h】

Lazy learner text categorization algorithm based on embedded feature selection

机译:基于嵌入式特征选择的懒惰学习者文本分类算法

获取原文
获取原文并翻译 | 示例
           

摘要

To avoid the curse of dimensionality, text categorization (TC) algorithms based on machine learning (ML) have to use an feature selection (FS) method to reduce the dimensionality of feature space. Although having been widely used, FS process will generally cause information losing and then have much side-effect on the whole performance of TC algorithms. On the basis of the sparsity characteristic of text vectors, a new TC algorithm based on lazy feature selection (LFS) is presented. As a new type of embedded feature selection approach, the LFS method can greatly reduce the dimension of features without any information losing, which can improve both efficiency and performance of algorithms greatly. The experiments show the new algorithm can simultaneously achieve much higher both performance and efficiency than some of other classical TC algorithms.
机译:为了避免维数的诅咒,基于机器学习(ML)的文本分类(TC)算法必须使用特征选择(FS)方法来减少特征空间的维数。尽管FS处理已被广泛使用,但通常会导致信息丢失,进而对TC算法的整体性能产生很大的负面影响。基于文本向量的稀疏性,提出了一种基于惰性特征选择(LFS)的TC算法。作为一种新型的嵌入式特征选择方法,LFS方法可以在不损失任何信息的情况下极大地减小特征的维数,从而可以大大提高算法的效率和性能。实验表明,与其他一些经典TC算法相比,新算法可以同时实现更高的性能和效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号