...
首页> 外文期刊>Journal of intelligent & fuzzy systems: Applications in Engineering and Technology >Hybrid feature selection using micro genetic algorithm on microarray gene expression data
【24h】

Hybrid feature selection using micro genetic algorithm on microarray gene expression data

机译:使用微遗传算法在微阵列基因表达数据上的混合特征选择

获取原文
获取原文并翻译 | 示例
           

摘要

Research has proved that DNA Microarray data containing gene expression profiles are potentially excellent diagnostic tools in the medical industry. A persistent problem with regard to accessible microarray datasets is that the number of samples are much lesser than the number of features that are present. Thus, in order to extract accurate information from the dataset, one must use a robust technique. Feature selection (FS) has proved to be an effective way by which irrelevant and noisy data can be discarded. In FS, relevant features are picked, and result in commendable classification accuracy. This paper proposes a model that employs a compounded / hybrid feature selection technique (Filter + Wrapper) to classify microarray cancer data. Initially, a filter method called Information Gain (IG) to eliminate redundant features that will not contribute significantly to the final classification is used. Following to that, an evolutionary computing technique (micro Genetic Algorithm (mGA)) to find the best minimal subset of required features is employed. Then the features are classified using a traditional Support Vector Classifier and also cross validated to obtain high classification accuracy, using a minimal number of features. The complexity of the model is reduced significantly by adding mGA, as opposed to already existing models that use various other feature selection algorithms.
机译:研究证明,含有基因表达谱的DNA微阵列数据是医学行业的潜在优异的诊断工具。关于可访问的微阵列数据集的持续问题是样本的数量远小于存在的特征数量。因此,为了从数据集中提取准确的信息,必须使用鲁棒技术。特征选择(FS)已被证明是一种有效的方法,可以丢弃无关紧要和无关紧要的数据。在FS中,挑选相关功能,并导致值得称道的分类准确性。本文提出了一种采用复合/混合特征选择技术(过滤器+包装物)来分类微阵列癌症数据的模型。最初,使用称为信息增益(IG)的滤波器方法,以消除对最终分类不会显着贡献的冗余功能。接下来,采用进化计算技术(微遗传算法(MGA))以找到所需特征的最佳最小值的子集。然后,使用传统的支持向量分类器分类,并且还使用最小数量的功能来分类为获得高分类精度的验证。通过添加MGA,模型的复杂性显着降低,而不是使用各种其他特征选择算法的现有模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号