首页> 外文学位 >A housekeeping gene-based procedure for the selection of differentially expressed genes for Affymetrix microarray experiments.
【24h】

A housekeeping gene-based procedure for the selection of differentially expressed genes for Affymetrix microarray experiments.

机译:基于管家基因的程序,用于选择Affymetrix微阵列实验的差异表达基因。

获取原文
获取原文并翻译 | 示例

摘要

This thesis is concerned with modeling and analyzing experimental Affymetrix gene expression microarray data. For the analysis part, we propose a cross-gene-rank procedure for the detection of differentially expressed genes for Affymetrix microarray experiments. This procedure is based on a comprehensive exploration of housekeeping genes with the objective of classifying them (Eisenberg and Levanon (2003)) into "good" (non-differentially expressed) and "bad" (differentially expressed) genes using the non-parametric classification method based on cross-gene ranks developed in this work.;The advantages of the cross-gene rank procedure in the context of microarray gene expressions are as follows: Testing the null hypothesis that the expression measurements of an individual gene have identical means across treatments is reduced to test the null hypothesis that its cross-gene ranks of expression measurements are constant across treatments, given we know that most of the genes are non-differential genes; Instead of taking the genes independently from each other as in the model-based inferences and the traditional settings of rank statistcs, we put an individual gene in the context of other genes by considering it's position among them. In this manner, the cross-gene rank and its rank statistic are valuable expansions of currently available inferences. It puts an expression measurement at the cross point of a gene and a treatment in a grid composed of treatments as columns and genes as rows.;For the modeling part, models for expression measurements of non-differentially expressed "good" housekeeping genes are developed, where the expression measurements follow a truncated normal distribution with mean-dependent variances that follow "asymptotic" regression models. The variances depend on both the gene expression level and the experimental conditions. The variation in expression measurements of "good" housekeeping genes captured by the models includes both the observed variation within a single treatment and between a treatment and the control. Proper modeling of the dependence of the variance on the mean expression level help improve the estimation of the variation in gene expression when only a small number of replicates are available for the analysis.;Further, comparisons across a treatment and the control are analyzed using fold change as a statistical measure for all pairwise comparisons. Null confidence bounds of the fold change statistic are determined for all genes.;Our simulation study shows that our procedure has lower error rate than the constant confidence bounds procedure which is popularly used to provide comparative data by investigators using microarrays. Our procedure also greatly increases the yield of statistically significant genes delivered by our new analysis methods.;Key words: Affymetrix gene expression microarray, housekeeping genes, cross-gene rank, linear model, truncated normal distribution, asymptotic variance model, fold change (ratio), confidence bound.
机译:本文涉及对Affymetrix基因表达微阵列实验数据的建模和分析。对于分析部分,我们提出了一种跨基因秩程序,用于为Affymetrix微阵列实验检测差异表达的基因。该程序基于对管家基因的全面探索,目的是使用非参数分类将它们(Eisenberg和Levanon(2003))分类为“好”(非差异表达)和“坏”(差异表达)基因。在这项工作中开发了基于交叉基因等级的方法。;在微阵列基因表达的背景下,交叉基因等级程序的优点如下:测试零假设,即单个基因的表达量度在治疗中具有相同的手段如果我们知道大多数基因是非差异基因,则将其简化以检验零假设,即其跨基因表达测量等级在治疗之间是恒定的。与其像基于模型的推论和等级统计的传统设置那样,使基因彼此独立,我们不考虑其在其他基因中的位置,而是将其置于其他基因的上下文中。以这种方式,交叉基因等级及其等级统计是当前可用推论的有价值的扩展。它在一个基因的交点处放置一个表达量,并在一个由处理作为列,基因作为行组成的网格中放置一个处理。;对于建模部分,开发了用于非差异表达“良好”管家基因表达测量的模型。 ,其中表达量度遵循截尾正态分布,均值相关方差遵循“渐近”回归模型。差异取决于基因表达水平和实验条件。由模型捕获的“良好”持家基因的表达测量值的变化既包括在单一处理中以及在处理与对照之间观察到的变化。当只有少量重复可用于分析时,对方差对平均表达水平的依赖性进行正确建模有助于改善基因表达变异的估计。此外,使用倍数分析处理与对照之间的比较变化作为所有成对比较的统计量度。对所有基因都确定了倍数变化统计的零置信区间。我们的模拟研究表明,与使用微阵列的研究者普遍使用的提供比较数据的恒定置信区间方法相比,我们的程序具有更低的错误率。我们的程序还极大地提高了通过我们的新分析方法提供的具有统计意义的基因的产量。关键词:Affymetrix基因表达微阵列,管家基因,交叉基因秩,线性模型,正态截断,渐近方差模型,倍数变化(比率) ),信心有限。

著录项

  • 作者

    Dong, Chunrong.;

  • 作者单位

    Case Western Reserve University.;

  • 授予单位 Case Western Reserve University.;
  • 学科 Biology Bioinformatics.
  • 学位 Ph.D.
  • 年度 2010
  • 页码 229 p.
  • 总页数 229
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号