首页> 中文期刊> 《系统工程与电子技术:英文版》 >Lazy learner text categorization algorithm based on embedded feature selection

Lazy learner text categorization algorithm based on embedded feature selection

         

摘要

To avoid the curse of dimensionality,text categorization(TC)algorithms based on machine learning (ML)have to use an feature selection(FS)method to reduce the dimensionality of feature space.Although having been widely used,FS process will generally cause information losing and then have much side-effect on the whole performance of TC algorithms.On the basis of the sparsity characteristic of text vectors,a new TC algorithm based on lazy feature selection(LFS)is presented.As a new type of embedded feature selection approach,the LFS method can greatly reduce the dimension of features without any information losing,which can improve both efficiency and performance of algorithms greatly.The experiments show the new algorithm can simultaneously achieve much higher both performance and efficiency than some of other classical TC algorithms.

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号