首页> 外文会议>International Conference on Data Science and Business Analytics >Research on Disease Classification Model and Algorithms Based on Gene Expression Data
【24h】

Research on Disease Classification Model and Algorithms Based on Gene Expression Data

机译:基于基因表达数据的疾病分类模型和算法研究

获取原文

摘要

High dimension, small sample size of gene expression data lead a great deal of difficulty to disease classification, in-depth model and algorithm research is carried out to solve this problem. Firstly, a linear combination model of weak classifier is constructed by boosting method and the feature subset is selected by removing the zero-weight feature genes in the boosting method. Then, three classification methods, boosting method, SVM and K-nearest neighbor are integrated to learn in order to improve the accuracy of the classification model. Finally, the classification model of ensemble learning is applied in colon cancer dataset. Rather than a single classification model, ensemble method can reduce dimension of data and obtain higher accuracy shown by the experimental results.
机译:高维,基因表达数据的小样本大小引发了疾病分类的大量困难,深入模型和算法研究进行了解决这个问题。首先,通过升压方法构造弱分类器的线性组合模型,通过在升压方法中去除零权重特征基因来选择特征子集。然后,集成了三种分类方法,提升方法,SVM和K最近邻居以学习以提高分类模型的准确性。最后,在结肠癌数据集中应用了集合学习的分类模型。集合方法而不是单个分类模型,可以减少数据的维度并获得实验结果所示的更高精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号