...
首页> 外文期刊>Procedia Computer Science >Feature Selection and Classification of Microarray Data using MapReduce based ANOVA and K-Nearest Neighbor
【24h】

Feature Selection and Classification of Microarray Data using MapReduce based ANOVA and K-Nearest Neighbor

机译:使用基于MapReduce的ANOVA和K最近邻对微阵列数据进行特征选择和分类

获取原文

摘要

The major drawback of microarray data is the ‘curse of dimensionality problem’, this hinders the useful information of dataset and leads to computational instability. Therefore, selecting relevant genes is an imperative in microarray data analysis. Most of the existing schemes employ a two-phase processes: feature selection/extraction followed by classification. In this paper, a statistical test, ANOVA based on MapReduce is proposed to select the relevant features. After feature selection, MapReduce based K-Nearest Neighbor (K-NN) classifier is also proposed to classify the microarray data. These algorithms are successfully implemented on Hadoop framework and comparative analysis is done using various datasets.
机译:微阵列数据的主要缺点是“维数问题”,这阻碍了数据集的有用信息并导致计算不稳定。因此,选择相关基因在微阵列数据分析中势在必行。大多数现有方案采用两个阶段的过程:特征选择/提取,然后进行分类。本文提出了一种基于MapReduce的统计检验ANOVA来选择相关特征。在特征选择之后,还提出了基于MapReduce的K最近邻(K-NN)分类器对微阵列数据进行分类。这些算法已在Hadoop框架上成功实现,并且使用各种数据集进行了比较分析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号