首页> 外文期刊>BMC Medical Research Methodology >A new statistical method for curve group analysis of longitudinal gene expression data illustrated for breast cancer in the NOWAC postgenome cohort as a proof of principle
【24h】

A new statistical method for curve group analysis of longitudinal gene expression data illustrated for breast cancer in the NOWAC postgenome cohort as a proof of principle

机译:作为NOWAC后基因组研究中乳腺癌纵向基因表达数据曲线组分析的一种新统计方法,作为原理证明

获取原文
获取外文期刊封面目录资料

摘要

Background The understanding of changes in temporal processes related to human carcinogenesis is limited. One approach for prospective functional genomic studies is to compile trajectories of differential expression of genes, based on measurements from many case-control pairs. We propose a new statistical method that does not assume any parametric shape for the gene trajectories. Methods The trajectory of a gene is defined as the curve representing the changes in gene expression levels in the blood as a function of time to cancer diagnosis. In a nested case–control design it consists of differences in gene expression levels between cases and controls. Genes can be grouped into curve groups, each curve group corresponding to genes with a similar development over time. The proposed new statistical approach is based on a set of hypothesis testing that can determine whether or not there is development in gene expression levels over time, and whether this development varies among different strata. Curve group analysis may reveal significant differences in gene expression levels over time among the different strata considered. This new method was applied as a “proof of concept” to breast cancer in the Norwegian Women and Cancer (NOWAC) postgenome cohort, using blood samples collected prospectively that were specifically preserved for transcriptomic analyses (PAX tube). Cohort members diagnosed with invasive breast cancer through 2009 were identified through linkage to the Cancer Registry of Norway, and for each case a random control from the postgenome cohort was also selected, matched by birth year and time of blood sampling, to create a case-control pair. After exclusions, 441 case-control pairs were available for analyses, in which we considered strata of lymph node status at time of diagnosis and time of diagnosis with respect to breast cancer screening visits. Results The development of gene expression levels in the NOWAC postgenome cohort varied in the last years before breast cancer diagnosis, and this development differed by lymph node status and participation in the Norwegian Breast Cancer Screening Program. The differences among the investigated strata appeared larger in the year before breast cancer diagnosis compared to earlier years. Conclusions This approach shows good properties in term of statistical power and type 1 error under minimal assumptions. When applied to a real data set it was able to discriminate between groups of genes with non-linear similar patterns before diagnosis.
机译:背景技术对与人类致癌作用有关的时间过程变化的理解是有限的。前瞻性功能基因组研究的一种方法是根据许多病例对照对的测量结果,编制基因差异表达的轨迹。我们提出了一种新的统计方法,该方法不假定基因轨迹具有任何参数形状。方法基因的轨迹定义为代表血液中基因表达水平随癌症诊断时间变化的曲线。在嵌套的病例对照设计中,它由病例与对照之间基因表达水平的差异组成。可以将基因分为曲线组,每个曲线组对应于随着时间的推移具有相似发展的基因。提议的新统计方法基于一组假设检验,可以确定基因表达水平是否随时间发展,以及这种发展在不同阶层之间是否不同。曲线组分析可能显示出所考虑的不同阶层之间基因表达水平随时间的显着差异。这项新方法被用于挪威妇女与癌症(NOWAC)后基因组研究中的乳腺癌“概念验证”,使用的是专门为转录组分析保存的血液样本(PAX管)。通过与挪威癌症登记处的联系确定了截至2009年被诊断为浸润性乳腺癌的队列成员,并且还针对每种情况从后基因组队列中随机选择一个对照,并与出生年份和血液采样时间相匹配,以创建一个病例-控制对。排除后,有441个病例对照对可用于分析,其中我们考虑了在诊断时和乳腺癌筛查就诊时的淋巴结状态分层。结果NOWAC后基因组队列中基因表达水平的发展在乳腺癌诊断之前的最后几年有所不同,并且这种发展因淋巴结状态和参与挪威乳腺癌筛查计划的不同而不同。与早年相比,在乳腺癌诊断之前的一年中,被调查阶层之间的差异似乎更大。结论在最小假设下,该方法在统计功效和1类错误方面显示出良好的特性。如果将其应用于真实数据集,则可以在诊断之前区分具有非线性相似模式的基因组。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号