The feature selection bias problem in relation to high-dimensional gene data

Krawczuk Jerzy; Lukaszuk Tomasz

首页> 外文期刊>Artificial intelligence in medicine >The feature selection bias problem in relation to high-dimensional gene data

【24h】

The feature selection bias problem in relation to high-dimensional gene data

机译：与高维基因数据有关的特征选择偏差问题

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Objective: Feature selection is a technique widely used in data mining. The aim is to select the best subset of features relevant to the problem being considered. In this paper, we consider feature selection for the classification of gene datasets. Gene data is usually composed of just a few dozen objects described by thousands of features. For this kind of data, it is easy to find a model that fits the learning data. However, it is not easy to find one that will simultaneously evaluate new data equally well as learning data. This overfitting issue is well known as regards classification and regression, but it also applies to feature selection.

机译：目的：特征选择是一种广泛用于数据挖掘的技术。目的是选择与所考虑问题相关的最佳特征子集。在本文中，我们考虑将特征选择用于基因数据集的分类。基因数据通常仅由几十个以数千种功能描述的对象组成。对于此类数据，很容易找到适合学习数据的模型。但是，要找到一个可以同时评估新数据和学习数据的方法并不容易。这个过拟合问题在分类和回归方面是众所周知的，但它也适用于特征选择。

著录项

来源
《Artificial intelligence in medicine》 |2016年第1期|63-71|共9页
作者
Krawczuk Jerzy; Lukaszuk Tomasz;
展开▼
作者单位

Bialystok Tech Univ, Fac Comp Sci, 45A Wiejska St, PL-15351 Bialystok, Poland;

Bialystok Tech Univ, Fac Comp Sci, 45A Wiejska St, PL-15351 Bialystok, Poland;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Feature selection bias; Convex and piecewise linear classifier; Support vector machine; Gene selection; Microarray data;

机译：特征选择偏差;凸和分段线性分类器;支持向量机;基因选择;芯片数据;
入库时间 2022-08-18 03:47:20

相似文献

外文文献
中文文献
专利

1. Investigation on particle swarm optimisation for feature selection on high-dimensional data: local search and selection bias [J] . Binh Tran, Xue Bing, Zhang Mengjie, Connection Science . 2016,第3期

机译：用于高维数据特征选择的粒子群算法研究：局部搜索和选择偏差
2. Survival analysis for high-dimensional, heterogeneous medical data: Exploring feature extraction as an alternative to feature selection [J] . Poelsterl Sebastian, Conjeti Sailesh, Navab Nassir, Artificial intelligence in medicine . 2016,第sepa期

机译：高维，异构医学数据的生存分析：探索特征提取作为特征选择的替代方法
3. A novel feature selection scheme for high-dimensional data sets: four-Staged Feature Selection [J] . Pehlivanli Ayca Cakmak Journal of applied statistics . 2016,第5a8期

机译：高维数据集的新颖特征选择方案：四阶段特征选择
4. Genetic feature selection combined with composite fuzzy nearest neighbor classifiers for high-dimensional remote sensing data [C] . Yu, S., De Backer, . 2000

机译：遗传特征选择与复合模糊最近邻分类器相结合的高维遥感数据
5. Novel Metrics and Theoretical Properties of Nearest-Neighbor Distance-Based Feature Selection in High-Dimensional Bioinformatics Data [D] . Dawkins, Bryan A. 2020

机译：高维生物信息学数据中最近邻距离的特征选择的新特性和理论特性
6. Comparison of Methods for Feature Selection in Clustering of High-Dimensional RNA-Sequencing Data to Identify Cancer Subtypes [O] . David Källberg, Linda Vidman, Patrik Rydén 2021

机译：高尺寸RNA测序数据聚类特征选择方法的比较识别癌症亚型
7. TAGA: Tabu Asexual Genetic Algorithm embedded in a filter/filter feature selection approach for high-dimensional data [O] . Sadegh Salesi, Georgina Cosma, Michalis Mavrovouniotis 2021

机译：塔加：禁忌无形遗传算法嵌入过滤器/过滤器特征选择方法，用于高维数据

The feature selection bias problem in relation to high-dimensional gene data

摘要

著录项

相似文献

相关主题

期刊订阅