首页> 美国卫生研究院文献>Journal of Computational Biology >Biomarker Discovery Using Statistically Significant Gene Sets
【2h】

Biomarker Discovery Using Statistically Significant Gene Sets

机译:使用具有统计学意义的基因集发现生物标记

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Analysis of large gene expression data sets in the presence and absence of a phenotype can lead to the selection of a group of genes serving as biomarkers jointly predicting the phenotype. Among gene selection methods, filter methods derived from ranked individual genes have been widely used in existing products for diagnosis and prognosis. Univariate filter approaches selecting genes individually, although computationally efficient, often ignore gene interactions inherent in the biological data. On the other hand, multivariate approaches selecting gene subsets are known to have a higher risk of selecting spurious gene subsets due to the overfitting of the vast number of gene subsets evaluated. Here we propose a framework of statistical significance tests for multivariate feature selection that can reduce the risk of selecting spurious gene subsets. Using three existing data sets, we show that our proposed approach is an essential step to identify such a gene set that is generated by a significant interaction of its members, even improving classification performance when compared to established approaches. This technique can be applied for the discovery of robust biomarkers for medical diagnosis.
机译:在存在和不存在表型的情况下对大型基因表达数据集进行分析可以导致选择一组基因作为共同预测该表型的生物标记。在基因选择方法中,源自排序的单个基因的过滤方法已广泛用于现有产品的诊断和预后。尽管计算效率高,但是单变量过滤器可单独选择基因,但通常会忽略生物学数据中固有的基因相互作用。另一方面,已知选择基因子集的多变量方法由于选择的大量基因子集过度拟合而具有较高的选择虚假基因子集的风险。在这里,我们提出了用于多元特征选择的统计显着性检验框架,可以减少选择虚假基因子集的风险。使用现有的三个数据集,我们证明了我们提出的方法是识别此类基因集的关键步骤,该基因集是由其成员之间的显着相互作用产生的,与已建立的方法相比,甚至可以提高分类性能。该技术可用于发现健壮的生物标志物以进行医学诊断。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号