...
首页> 外文期刊>BMC Bioinformatics >RnaSeqSampleSize: real data based sample size estimation for RNA sequencing
【24h】

RnaSeqSampleSize: real data based sample size estimation for RNA sequencing

机译:RNASEQSAMPLESIZE:RNA测序的真实数据示例大小估计

获取原文

摘要

One of the most important and often neglected components of a successful RNA sequencing (RNA-Seq) experiment is sample size estimation. A few negative binomial model-based methods have been developed to estimate sample size based on the parameters of a single gene. However, thousands of genes are quantified and tested for differential expression simultaneously in RNA-Seq experiments. Thus, additional issues should be carefully addressed, including the false discovery rate for multiple statistic tests, widely distributed read counts and dispersions for different genes. To solve these issues, we developed a sample size and power estimation method named RnaSeqSampleSize, based on the distributions of gene average read counts and dispersions estimated from real RNA-seq data. Datasets from previous, similar experiments such as the Cancer Genome Atlas (TCGA) can be used as a point of reference. Read counts and their dispersions were estimated from the reference’s distribution; using that information, we estimated and summarized the power and sample size. RnaSeqSampleSize is implemented in R language and can be installed from Bioconductor website. A user friendly web graphic interface is provided at http://cqs.mc.vanderbilt.edu/shiny/RnaSeqSampleSize/ . RnaSeqSampleSize provides a convenient and powerful way for power and sample size estimation for an RNAseq experiment. It is also equipped with several unique features, including estimation for interested genes or pathway, power curve visualization, and parameter optimization.
机译:成功RNA测序(RNA-SEQ)实验的最重要且经常被忽略的组分之一是样本量估计。已经开发出一些基于二项式模型的方法来基于单个基因的参数来估计样品大小。然而,在RNA-SEQ实验中同时定量数千个基因并测试差异表达。因此,应仔细解决额外的问题,包括多种统计测试的错误发现率,对于不同基因的广泛分布读数和分散。为了解决这些问题,我们开发了一种名为RNASEQSAMPLESIZE的样本大小和功率估计方法,基于来自真实RNA-SEQ数据估计的基因平均读数和分散体的分布。来自先前的数据集,例如癌症基因组Atlas(TCGA)的类似实验可以用作参考点。从参考分布估计读数,它们的分散量估计;使用该信息,我们估计并总结了电源和样本大小。 RNASEQSAMPLESIZPERS以R语言实现,可以从BIOCOCTER网站安装。 http://cqs.mc.vanderbilt.edu/shiny/rnaseqsamplesize/提供用户友好的网络图形界面。 RNASEQSAMPLESIZE为RNASEQ实验提供了一种方便而强大的方法,可用于电源和示例大小估计。它还配备了几种独特的功能,包括估计感兴趣的基因或通路,电源曲线可视化和参数优化。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号