首页> 外文期刊>IEEE/ACM transactions on computational biology and bioinformatics >Supervised, Unsupervised, and Semi-Supervised Feature Selection: A Review on Gene Selection
【24h】

Supervised, Unsupervised, and Semi-Supervised Feature Selection: A Review on Gene Selection

机译:有监督,无监督和半监督特征选择:基因选择综述

获取原文
获取原文并翻译 | 示例
           

摘要

Recently, feature selection and dimensionality reduction have become fundamental tools for many data mining tasks, especially for processing high-dimensional data such as gene expression microarray data. Gene expression microarray data comprises up to hundreds of thousands of features with relatively small sample size. Because learning algorithms usually do not work well with this kind of data, a challenge to reduce the data dimensionality arises. A huge number of gene selection are applied to select a subset of relevant features for model construction and to seek for better cancer classification performance. This paper presents the basic taxonomy of feature selection, and also reviews the state-of-the-art gene selection methods by grouping the literatures into three categories: supervised, unsupervised, and semi-supervised. The comparison of experimental results on top 5 representative gene expression datasets indicates that the classification accuracy of unsupervised and semi-supervised feature selection is competitive with supervised feature selection.
机译:最近,特征选择和降维已成为许多数据挖掘任务的基本工具,尤其是用于处理高维数据(如基因表达微阵列数据)的工具。基因表达微阵列数据包含多达数十万个特征,且样本量相对较小。由于学习算法通常不适用于此类数据,因此出现了降低数据维数的挑战。大量的基因选择被用于选择相关特征的子集以进行模型构建并寻求更好的癌症分类性能。本文介绍了特征选择的基本分类法,并通过将文献分为三类:有监督,无监督和半监督,回顾了最新的基因选择方法。在前5个代表性基因表达数据集上的实验结果进行比较表明,无监督和半监督特征选择的分类精度与监督特征选择具有竞争性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号