首页> 外文期刊>Wuhan University Journal of Natural Sciences >A New Feature Selection Method for Text Clustering
【24h】

A New Feature Selection Method for Text Clustering

机译:文本聚类的新特征选择方法

获取原文
获取原文并翻译 | 示例
       

摘要

Feature selection methods have been successfully applied to text categorization but seldom applied to text clusteringdue to the unavailability of class label information. In this paper, anew feature selection method for text clustering based on expectation maximization and cluster validity is proposed. It uses supervised feature selection method on the intermediate clustering resultwhich is generated during iterative clustering to do feature selection for text clustering; meanwhile, the Davies-Bouldin's index isused to evaluate the intermediate feature subsets indirectly. Thenfeature subsets are selected according to the curve of the DaviesBouldin's index. Experiment is carried out on several populardatasets and the results show the advantages of the proposedmethod.
机译:特征选择方法已经成功地应用于文本分类,但由于类标签信息的不可用,很少用于文本聚类。提出了一种基于期望最大化和聚类有效性的文本聚类特征选择方法。对迭代聚类过程中产生的中间聚类结果采用监督特征选择方法进行文本聚类的特征选择。同时,Davies-Bouldin指数用于间接评估中间特征子集。然后根据DaviesBouldin指数的曲线选择特征子集。在几种流行的数据集上进行了实验,结果表明了该方法的优点。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号