...
首页> 外文期刊>The Analyst >Noise incorporated subwindow permutation analysis for informative gene selection using support vector machines
【24h】

Noise incorporated subwindow permutation analysis for informative gene selection using support vector machines

机译:使用支持向量机将包含噪声的子窗口置换分析用于信息丰富的基因选择

获取原文
获取原文并翻译 | 示例
           

摘要

Selecting a small subset of informative genes plays an important role in accurate prediction of clinical tumor samples. Based on model population analysis, a novel variable selection method, called noise incorporated subwindow permutation analysis (NISPA), is proposed in this study to work with support vector machines (SVMs). The essence of NISPA lies in the point that one noise variable is added into each sampled sub-dataset and then the distribution of variable importance of the added noise could be computed and serves as the common reference to evaluate the experimental variables. Further, by using the non-parametric Mann–Whitney U test, a P value can be assigned to each variable which describes to what extent the distributions of the gene variable and the noise variable are different. According to the computed P values, all the variables could be ranked and then a small subset of informative variables could be determined to build the model. Moreover, by NISPA, we are the first to distinguish the variables into a more detailed classification as informative, uninformative (noise) and interfering variables in comparison with other methods. In this study, two microarray datasets are employed to evaluate the performance of NISPA. The results show that the prediction errors of SVM classifiers could be significantly reduced by variable selection using NISPA. It is concluded that NISPA is a good alternative of variable selection algorithm.
机译:选择一小部分信息基因在准确预测临床肿瘤样本中起着重要作用。基于模型总体分析,本研究提出了一种新的变量选择方法,称为噪声合并子窗口置换分析(NISPA),以与支持向量机(SVM)一起使用。 NISPA的本质在于将一个噪声变量添加到每个采样的子数据集中,然后可以计算所添加噪声的变量重要性的分布,并可以用作评估实验变量的通用参考。此外,通过使用非参数Mann-Whitney U检验,可以为每个变量分配一个P值,该值描述了基因变量和噪声变量的分布差异程度。根据计算出的P值,可以对所有变量进行排序,然后可以确定信息变量的一小部分以建立模型。此外,通过NISPA,我们是第一个将变量与其他方法相比,进行更详细分类的分类,包括信息性,非信息性(噪声)和干扰性变量。在这项研究中,使用了两个微阵列数据集来评估NISPA的性能。结果表明,通过使用NISPA进行变量选择,可以显着减少SVM分类器的预测误差。结论是NISPA是变量选择算法的良好替代方案。

著录项

  • 来源
    《The Analyst》 |2011年第7期|p.1456-1463|共8页
  • 作者单位

    aResearch Center of Modernization of Traditional Chinese Medicines, College of Chemistry and Chemical Engineering, Central South University, Changsha 410083, P. R. China. E-mail: yizeng_liang@263.netbSchool of Mathematic Sciences, Central South University, Changsha 410083, P. R. China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号