A Multivariate Feature Selection Framework for High Dimensional Biomedical Data Classification

机译：用于高维生物医学数据分类的多变量特征选择框架

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

High dimensional biomedical data are becoming common in various predictive models developed for disease diagnosis and prognosis. Extracting knowledge from high dimensional data which contain a large number of features and a small sample size presents intrinsic challenges for classification models. Genetic Algorithms can be successfully adopted to efficiently search through high dimensional spaces, and multivariate classification methods can be utilized to evaluate combinations of features for constructing optimized predictive models. This paper proposes a framework which can be adopted for building prediction models for high dimensional biomedical data. The proposed framework comprises of three main phases. The feature filtering phase which filters out the noisy features; the feature selection phase which is based on multivariate machine learning techniques and the Genetic Algorithm to evaluate the filtered features and select the most informative subsets of features for achieving maximum classification performance; and the predictive modeling phase during which machine learning algorithms are trained on the selected features to construct a reliable prediction model. Experiments were conducted using four high dimensional biomedical datasets including protein and gene-expression data. The results revealed optimistic performances for the multivariate selection approaches which utilize classification measurements based on implicit assumptions.

机译：在为疾病诊断和预后开发的各种预测模型中，高维生物医学数据变得常见。从包含大量特征的高维数据中提取知识和小样本大小呈现出分类模型的内在挑战。可以成功采用遗传算法以有效地通过高维空间搜索，并且可以利用多变量分类方法来评估用于构建优化的预测模型的特征的组合。本文提出了一种框架，可用于构建高维生物医学数据的预测模型。所提出的框架包括三个主要阶段。功能过滤阶段滤除嘈杂功能;特征选择阶段基于多变量机器学习技术和遗传算法来评估过滤的功能，选择最大的功能子集，以实现最大分类性能;并且预测建模阶段在所选特征上培训机器学习算法以构建可靠的预测模型。使用包括蛋白质和基因表达数据的四个高尺寸生物医学数据集进行实验。结果揭示了利用基于隐含假设的分类测量的多变量选择方法的乐观性能。

著录项

来源
《IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology》|2017年|321p|共8页
会议地点
作者
Abeer Alzubaidi; Georgina Cosma;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 Q81-53;
关键词
Multivariate feature selection; Classification; Genetic Algorithm; High dimensional data; Biomedical data; Gaussian Naive Bayes; Linear Discriminant Analysis; Support Vector Machine; K-nearest Neighbor; Optimization;

机译：多变量特征选择;分类;遗传算法;高维数据;生物医学数据;高斯天真贝叶斯;线性判别分析;支持向量机;k最近邻居;优化;

相似文献

外文文献
中文文献
专利

1. A contemporary feature selection and classification framework for imbalanced biomedical datasets [J] . Thulasi Bikku, Sambasiva Rao Nandam, Ananda Rao Akepogu Egyptian Informatics Journal . 2018,第3期

机译：不平衡生物医学数据集的当代特征选择和分类框架
2. Feature Selection, Mutual Information, And The Classification Of High-dimensional Patternsapplications To Image Classification And Microarray Data Analysis [J] . Boyan Bonev, Francisco Escolano Miguel Cazorla Pattern Analysis and Applications . 2008,第3a4期

机译：特征选择，互信息和高维模式分类在图像分类和微阵列数据分析中的应用
3. A Novel Feature Selection Method for High-Dimensional Biomedical Data Based on an Improved Binary Clonal Flower Pollination Algorithm [J] . Yan Chaokun, Ma Jingjing, Luo Huimin, Human Heredity . 2019,第1期

机译：基于改进二元克隆花授粉算法的高维生物医学数据的新特征选择方法
4. A Multivariate Feature Selection Framework for High Dimensional Biomedical Data Classification [C] . Abeer Alzubaidi, Georgina Cosma IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology . 2017

机译：用于高维生物医学数据分类的多变量特征选择框架
5. Feature Selection and Classification for High-Dimensional Biological Data Under Cross-Validation Framework [D] . Zhong, Yi. 2018

机译：交叉验证框架下高维生物数据的特征选择与分类
6. Classification of high dimensional biomedical data based on feature selection using redundant removal [O] . Bingtao Zhang, Peng Cao 2012

机译：基于特征选择的冗余去除对高维生物医学数据进行分类
7. Dimensionality Reduction Techniques for Multivariate Data Classification, Interactive Visualization, and Analysis -- Systematic Feature Selection vs. Extraction [O] . Andreas König 2000

机译：多元数据分类，交互式可视化和分析的维数降低技术 - 系统特征选择与提取

A Multivariate Feature Selection Framework for High Dimensional Biomedical Data Classification

摘要

著录项

相似文献

相关主题

期刊订阅