首页> 外文学位 >Statistical hypothesis testing and application to biological data.
【24h】

Statistical hypothesis testing and application to biological data.

机译:统计假设检验并将其应用于生物学数据。

获取原文
获取原文并翻译 | 示例

摘要

Recent developments in biological research, for instance genomics and proteomics, have created new statistical challenges for scientists. The notion of hypothesis testing is of utmost importance when determining genes which arc, for example, differentially expressed between two disease states. Using a multiple testing technique which accurately controls the false positive rate with adequate power is vital. This doctoral dissertation is focused on statistical hypothesis testing and is broken up into seven chapters.; Chapter 1 focuses on the concept of hypothesis testing and in particular provides a brief outline of multiple testing and the existing methodologies. Chapter 2 reviews a variety of multiple testing methods. In particular, this chapter will focus on the Pollard and van der Laan (2003) resampling based multiple testing method including a comparison to existing procedures.; Chapter 3 describes the newly proposed Empirical Bayes/TPPFP procedure, which has been found to be more powerful and less conservative as compared to existing multiple testing procedures, based on several simulation results.; The next two chapters illustrate an application of the multiple testing procedure on two biological datasets. Chapter 4 describes an HIV-1 dataset which consists of reverse transcriptase and protease codon positions and an outcome of replication capacity. The goal of this analysis is to determine which codon mutations are univariately associated with viral replication. Chapter 5 describes a proteomic dataset which consists of mass spectrometry data from both acute myeloid leukemia (AML) and acute lymphoblastic leukemia (ALL) patients. The goal of this analysis is to correctly preprocess the spectral data and subsequently apply the multiple testing procedures to determine which proteins are differentially expressed between the two leukemia subtypes.; A general hypothesis technique is described and explored in Chapter 6. This general pathway testing procedure is applied to the previously described HIV-1 dataset. This procedure will determine the significance of the specific model, which represents the relationship between viral codons (protease and reverse transcriptase) and replication capacity. Finally, Chapter 7 provides a conclusion and summary of the preceding chapters.
机译:生物学研究的最新进展,例如基因组学和蛋白质组学,为科学家带来了新的统计挑战。当确定例如在两种疾病状态之间差异表达的基因时,假设检验的概念至关重要。使用多种测试技术以足够的功率准确控制误报率至关重要。该博士论文侧重于统计假设检验,分为七个章节。第1章着重于假设检验的概念,特别是简要介绍了多重检验和现有方法。第2章回顾了多种测试方法。特别是,本章将重点介绍基于Pollard和van der Laan(2003)重采样的多重测试方法,包括与现有程序的比较。第三章介绍了新提出的经验贝叶斯/ TPPFP程序,根据一些模拟结果,与现有的多种测试程序相比,该方法被发现功能更强大,保守性更低。接下来的两章说明了多重测试程序在两个生物学数据集上的应用。第4章介绍了一个HIV-1数据集,该数据集由逆转录酶和蛋白酶密码子位置以及复制能力的结果组成。该分析的目的是确定哪些密码子突变与病毒复制单变量相关。第5章介绍了蛋白质组学数据集,该数据集由急性髓细胞性白血病(AML)和急性淋巴细胞性白血病(ALL)患者的质谱数据组成。该分析的目的是正确地预处理光谱数据,然后应用多种测试程序来确定两种白血病亚型之间差异表达的蛋白质。第6章介绍并探讨了一种一般的假设技术。该一般途径测试程序适用于先前描述的HIV-1数据集。此过程将确定特定模型的重要性,该模型代表病毒密码子(蛋白酶和逆转录酶)与复制能力之间的关系。最后,第7章提供了对前几章的总结和总结。

著录项

  • 作者

    Birkner, Merrill Dobbel.;

  • 作者单位

    University of California, Berkeley.;

  • 授予单位 University of California, Berkeley.;
  • 学科 Biology Biostatistics.
  • 学位 Ph.D.
  • 年度 2006
  • 页码 121 p.
  • 总页数 121
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 生物数学方法;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号