Invisible fence methods and the identification of differentially expressed gene sets

Jiming Jiang; Thuan Nguyen; J. Sunil Rao

首页> 外文期刊>Statistics and Its Interface >Invisible fence methods and the identification of differentially expressed gene sets

【24h】

Invisible fence methods and the identification of differentially expressed gene sets

机译：隐形围栏方法和差异表达基因集的鉴定

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The fence method (Jiang et al. 2008; Ann. Statist. 36, 1669–1692) is a recently developed strategy for model selection. The idea involves a procedure to isolate a subgroup of what are known as correct models (of which the optimal model is a member). This is accomplished by constructing a statistical $fence$, or barrier, to carefully eliminate incorrect models. Once the fence is constructed, the optimal model is selected from amongst those within the fence according to a criterion which can be made flexible. The construction of the fence can be made adaptively to improve finite sample performance. We extend the fence method to situations where a true model may not exist or be among the candidate models. Furthermore, another look at the fence methods leads to a new procedure, known as invisible fence (IF). A fast algorithm is developed for IF in the case of subtractive measure of lack-of-fit. The main focus of the current paper is microarray gene-set analysis. In particular, Efron and Tibshirani (2007; Ann. Appl. Statist. 1, 107–129) proposed a gene set analysis (GSA) method based on testing the significance of gene-sets. In typical situations of microarray experiments the number of genes is much larger than the number of microarrays. This special feature presents a real challenge to implementation of IF to microarray gene-set analysis. We show how to solve this problem in this paper, and carry out an extensive Monte Carlo simulation study that compares the performances of IF and GSA in identifying differentially expressed gene-sets. The results show that IF outperforms GSA, in most cases significantly, uniformly across all the cases considered. Furthermore, we demonstrate both theoretically and empirically the consistency property of IF, while pointing out the inconsistency of GSA under certain situations. An application in tracking pathway involvement in late vs earlier stage colon cancers is considered.

机译：围栏方法（Jiang等人，2008; Ann。Statist。36，1669–1692）是最近开发的模型选择策略。这个想法涉及一个程序，该程序用于隔离所谓正确模型（最佳模型是其中的一个）的子组。这可以通过构造统计围栏或障碍来仔细消除错误的模型来完成。一旦建造了围栏，就根据可以变得灵活的准则从围栏内的模型中选择最佳模型。可以自适应地制作围栏，以提高有限的样本性能。我们将篱笆方法扩展到可能不存在真实模型或在候选模型之中的情况。此外，对防护方法的另一种观察导致了一种新的过程，称为“隐形防护（IF）”。在减去拟合不足的情况下，针对IF开发了一种快速算法。本文的主要重点是微阵列基因集分析。特别是，Efron和Tibshirani（2007; Ann。Appl。Statist。1，107-129）提出了一种基于检验基因组重要性的基因组分析（GSA）方法。在微阵列实验的典型情况下，基因的数量远大于微阵列的数量。这一特殊功能为将IF应用于微阵列基因组分析提出了真正的挑战。我们在本文中展示了如何解决此问题，并进行了广泛的蒙特卡洛模拟研究，比较了IF和GSA在鉴定差异表达基因组中的性能。结果表明，在所有考虑的案例中，IF在大多数情况下均明显优于GSA。此外，我们在理论和经验上都证明了IF的一致性，同时指出了在某些情况下GSA的不一致。考虑了在晚期与早期结肠癌中追踪通路参与的应用。

著录项

来源
《Statistics and Its Interface》 |2011年第3期|共13页
作者
Jiming Jiang; Thuan Nguyen; J. Sunil Rao;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类统计学;
关键词

相似文献

外文文献
中文文献
专利

1. A Biological Evaluation of Six Gene Set Analysis Methods for Identification of Differentially Expressed Pathways in Microarray Data [J] . Irina Dinu, Qi Liu, John D. Potter, Cancer Informatics . 2008,第1期

机译：鉴定微阵列数据中差异表达途径的六种基因组分析方法的生物学评估
2. Quantitative set analysis for gene expression: a method to quantify gene set differential expression including gene-gene correlations [J] . Yaari Gur, Bolen Christopher R., Thakar Juilee, Nucleic Acids Research . 2013,第18期

机译：基因表达的定量集分析：一种量化包括基因-基因相关性的基因集差异表达的方法
3. Quantitative set analysis for gene expression: a method to quantify gene set differential expression including gene-gene correlations [J] . Christopher R. Bolen, Gur Yaari, Juilee Thakar, Nucleic acids research . 2013,第18期

机译：基因表达的定量集分析：一种量化基因集差异表达的方法，包括基因-基因相关性
4. Method for the identification of the subsets of genes specifically consistently co-expressed in a set of datasets [C] . Abu-Jamous Basel, Fa Rui, Roberts David J., IEEE International Workshop on Machine Learning for Signal Processing . 2013

机译：鉴定在一组数据集中明确一致表达的基因子集的方法
5. Identification of differentially expressed genes and gene sets using a modified Q-value. [D] . Bentil, Ekua Fesuwa. 2014

机译：使用修饰的Q值鉴定差异表达的基因和基因集。
6. A Biological Evaluation of Six Gene Set Analysis Methods for Identification of Differentially Expressed Pathways in Microarray Data [O] . Irina Dinu, Qi Liu, John D. Potter, 2008

机译：鉴定微阵列数据中差异表达途径的六种基因组分析方法的生物学评估
7. Analysis of the real EADGENE data set: Comparison of methods and guidelines for data normalisation and selection of differentially expressed genes (Open Access publication) [O] . Florence Jaffrézic, Dirk-Jan de Koning, Paul J Boettcher, 2009

机译：实际EADGENE数据集的分析：数据归一化和差异表达基因选择的方法和指南的比较（开放获取出版物）

Invisible fence methods and the identification of differentially expressed gene sets

摘要

著录项

相似文献

相关主题

期刊订阅