...
首页> 外文期刊>Scientific reports. >A Deep Neural Network Model using Random Forest to Extract Feature Representation for Gene Expression Data Classification
【24h】

A Deep Neural Network Model using Random Forest to Extract Feature Representation for Gene Expression Data Classification

机译:利用随机森林提取特征表示的深度神经网络模型,用于基因表达数据分类

获取原文
   

获取外文期刊封面封底 >>

       

摘要

In predictive model development, gene expression data is associated with the unique challenge that the number of samples (n) is much smaller than the amount of features (p). This “n???p” property has prevented classification of gene expression data from deep learning techniques, which have been proved powerful under “n??p” scenarios in other application fields, such as image classification. Further, the sparsity of effective features with unknown correlation structures in gene expression profiles brings more challenges for classification tasks. To tackle these problems, we propose a newly developed classifier named Forest Deep Neural Network (fDNN), to integrate the deep neural network architecture with a supervised forest feature detector. Using this built-in feature detector, the method is able to learn sparse feature representations and feed the representations into a neural network to mitigate the overfitting problem. Simulation experiments and real data analyses using two RNA-seq expression datasets are conducted to evaluate fDNN’s capability. The method is demonstrated a useful addition to current predictive models with better classification performance and more meaningful selected features compared to ordinary random forests and deep neural networks.
机译:在预测模型开发中,基因表达数据与唯一的挑战相关,即样本数量(n)远小于特征数量(p)。这种“ n ??? p”属性阻止了来自深度学习技术的基因表达数据分类,在深度学习技术中,在图像分类等其他应用领域的“n≥p”场景下,事实证明这种功能很强大。此外,基因表达谱中具有未知相关结构的有效特征的稀疏性给分类任务带来了更多挑战。为了解决这些问题,我们提出了一种新开发的分类器,称为森林深层神经网络(fDNN),以将深层神经网络架构与监督的森林特征检测器集成在一起。使用此内置特征检测器,该方法能够学习稀疏特征表示并将这些表示馈入神经网络以减轻过拟合问题。使用两个RNA-seq表达数据集进行了模拟实验和真实数据分析,以评估fDNN的功能。与普通的随机森林和深度神经网络相比,该方法被证明是当前预测模型的有用补充,具有更好的分类性能和更有意义的选择特征。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号