首页> 外文OA文献 >Self-Contained Gene-Set Analysis of Expression Data: An Evaluation of Existing and Novel Methods
【2h】

Self-Contained Gene-Set Analysis of Expression Data: An Evaluation of Existing and Novel Methods

机译:表达数据的自包含基因集分析:现有方法和新方法的评估

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Gene set methods aim to assess the overall evidence of association of a set of genes with a phenotype, such as disease or a quantitative trait. Multiple approaches for gene set analysis of expression data have been proposed. They can be divided into two types: competitive and self-contained. Benefits of self-contained methods include that they can be used for genome-wide, candidate gene, or pathway studies, and have been reported to be more powerful than competitive methods. We therefore investigated ten self-contained methods that can be used for continuous, discrete and time-to-event phenotypes. To assess the power and type I error rate for the various previously proposed and novel approaches, an extensive simulation study was completed in which the scenarios varied according to: number of genes in a gene set, number of genes associated with the phenotype, effect sizes, correlation between expression of genes within a gene set, and the sample size. In addition to the simulated data, the various methods were applied to a pharmacogenomic study of the drug gemcitabine. Simulation results demonstrated that overall Fisher's method and the global model with random effects have the highest power for a wide range of scenarios, while the analysis based on the first principal component and Kolmogorov-Smirnov test tended to have lowest power. The methods investigated here are likely to play an important role in identifying pathways that contribute to complex traits.
机译:基因组方法旨在评估一组基因与表型(例如疾病或定量性状)相关的整体证据。已经提出了多种表达数据的基因组分析方法。它们可以分为两种类型:竞争性的和独立的。自包含方法的优点包括可以用于全基因组,候选基因或途径研究,据报道比竞争方法更强大。因此,我们研究了十种可用于连续,离散和事件发生时间表型的独立方法。为了评估以前提出的各种新方法的功效和I型错误率,完成了一项广泛的模拟研究,其中的场景根据以下条件而变化:基因集中的基因数量,与表型相关的基因数量,效应大小,基因集内的基因表达与样本量之间的相关性。除了模拟数据外,各种方法还用于吉西他滨药物的药物基因组学研究。仿真结果表明,在各种情况下,整体Fisher方法和具有随机效应的全局模型具有最高的功效,而基于第一个主成分和Kolmogorov-Smirnov检验的分析的功效往往最低。在此研究的方法可能在鉴定有助于复杂性状的途径中起重要作用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号