首页> 外文会议>International Conference on Computing Methodologies and Communication >Score-Based Feature Selection of Gene expression Data for Cancer Classification
【24h】

Score-Based Feature Selection of Gene expression Data for Cancer Classification

机译:基于得分的基因表达数据特征选择用于癌症分类

获取原文

摘要

Feature selection in machine learning can also be specified as attribute selection. It is a process of selection desired feature from a large amount of data set. A typical microarray data set has basic properties such as high-dimensionality and limited sample, which makes it less accurate for classification and also time-consuming. In order to increase the accuracy of the classification, we have to decrease the dimensionality of the dataset. To achieve this, there are two feature elimination methods namely, feature selection and feature extraction. The proposed study focuses on the filter-based feature selection method. The main aim of the proposed work is to decrease the computation time and increase the accuracy of classification and prediction. To achieve this, he proposed work reduces the dimensionality of data set and also the redundancy between various features. Several feature selection methods exist but most of them have increased computational time, so here we are using score-based criteria fusion method for feature selection, which improves the prediction accuracy and decreases the computational time.
机译:机器学习中的特征选择也可以指定为属性选择。这是从大量数据集中选择所需特征的过程。典型的微阵列数据集具有基本属性,例如高维和有限的样本,这使其分类的准确性降低,并且耗时。为了提高分类的准确性,我们必须降低数据集的维数。为此,有两种特征消除方法,即特征选择和特征提取。拟议的研究集中在基于过滤器的特征选择方法上。拟议工作的主要目的是减少计算时间,提高分类和预测的准确性。为了实现这一点,他提出了减少数据集的维数以及减少各种功能之间的冗余的建议。存在几种特征选择方法,但是大多数特征选择方法增加了计算时间,因此在这里我们使用基于分数的标准融合方法进行特征选择,这提高了预测精度并减少了计算时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号