...
首页> 外文期刊>Journal of Theoretical Biology >SGL-SVM: A novel method for tumor classification via support vector machine with sparse group Lasso
【24h】

SGL-SVM: A novel method for tumor classification via support vector machine with sparse group Lasso

机译:SGL-SVM:通过带稀疏组套索的支持向量机的肿瘤分类一种新方法

获取原文
获取原文并翻译 | 示例

摘要

At present, with the in-depth study of gene expression data, the significant role of tumor classification in clinical medicine has become more apparent. In particular, the sparse characteristics of gene expression data within and between groups. Therefore, this paper focuses on the study of tumor classification based on the sparsity characteristics of genes. On this basis, we propose a new method of tumor classification-Sparse Group Lasso (least absolute shrinkage and selection operator) and Support Vector Machine (SGL-SVM). Firstly, the primary selection of feature genes is performed on the normalized tumor datasets using the Kruskal-Wallis rank sum test. Secondly, using a sparse group Lasso for further selection, and finally, the support vector machine serves as a classifier for classification. We validate proposed method on microarray and NGS datasets respectively. Formerly, on three two-class and five multi-class microarray datasets it is tested by 10-fold cross-validation and compared with other three classifiers. SGL-SVM is then applied on BRCA and GBM datasets and tested by 5-fold cross-validation. Satisfactory accuracy is obtained by above experiments and compared with other proposed methods. The experimental results show that the proposed method achieves a higher classification accuracy and selects fewer feature genes, which can be widely applied in classification for high-dimensional and small-sample tumor datasets. The source code and all datasets are available at https://github.com/QUST-AIBBDRC/SGL-SVM/. (C) 2019 Elsevier Ltd. All rights reserved.
机译:目前,随着基因表达数据的深入研究,肿瘤分类在临床医学中的显着作用变得更加明显。特别地,基因表达数据和组之间基因表达数据的稀疏特征。因此,本文重点研究了基于基因稀疏特征的肿瘤分类研究。在此基础上,我们提出了一种新的肿瘤分类 - 稀疏组套索(最小绝对收缩和选择操作员)和支持向量机(SVM)。首先,使用kruskal-wallis等级和测试对归一化肿瘤数据集进行特征基因的主要选择。其次,使用稀疏组套索进行进一步选择,最后,支持向量机用作分类的分类器。我们分别在微阵列和NGS数据集上验证了提出的方法。以前,在三类和五个多级微阵列数据集上,它通过10倍交叉验证测试,并与其他三个分类器进行比较。然后将SGL-SVM应用于BRCA和GBM数据集,并通过5倍交叉验证测试。通过上述实验获得令人满意的精度,并与其他提出的方法进行比较。实验结果表明,该方法的分类精度达到了更高的分类精度,并选择了更少的特征基因,这可以广泛应用于高维和小样本肿瘤数据集的分类。源代码和所有数据集可用于https://github.com/qust-aibbdrc/sgl-svm/。 (c)2019年elestvier有限公司保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号