首页> 外文期刊>Scientific reports. >Broad distribution spectrum from Gaussian to power law appears in stochastic variations in RNA-seq data
【24h】

Broad distribution spectrum from Gaussian to power law appears in stochastic variations in RNA-seq data

机译:从高斯到幂律的广泛分布频谱出现在RNA-seq数据的随机变化中

获取原文
           

摘要

Gene expression levels exhibit stochastic variations among genetically identical organisms under the same environmental conditions. In many recent transcriptome analyses based on RNA sequencing (RNA-seq), variations in gene expression levels among replicates were assumed to follow a negative binomial distribution, although the physiological basis of this assumption remains unclear. In this study, RNA-seq data were obtained from Arabidopsis thaliana under eight conditions (21–27 replicates), and the characteristics of gene-dependent empirical probability density function (ePDF) profiles of gene expression levels were analyzed. For A. thaliana and Saccharomyces cerevisiae, various types of ePDF of gene expression levels were obtained that were classified as Gaussian, power law-like containing a long tail, or intermediate. These ePDF profiles were well fitted with a Gauss-power mixing distribution function derived from a simple model of a stochastic transcriptional network containing a feedback loop. The fitting function suggested that gene expression levels with long-tailed ePDFs would be strongly influenced by feedback regulation. Furthermore, the features of gene expression levels are correlated with their functions, with the levels of essential genes tending to follow a Gaussian-like ePDF while those of genes encoding nucleic acid-binding proteins and transcription factors exhibit long-tailed ePDF.
机译:在相同的环境条件下,基因表达水平在遗传相同的生物之间表现出随机变化。在许多最近的基于RNA测序(RNA-seq)的转录组分析中,假定重复基因表达水平的变化遵循负二项式分布,尽管这一假设的生理基础尚不清楚。在这项研究中,在八个条件下(21-27个重复)从拟南芥中获得RNA-seq数据,并分析了基因表达水平的基因依赖性经验概率密度函数(ePDF)谱的特征。对于拟南芥和酿酒酵母,获得了各种类型的基因表达水平的ePDF,它们被分类为高斯,幂律样,长尾或中间。这些ePDF配置文件与高斯功率混合分布函数很好地拟合,该函数来自包含反馈环的随机转录网络的简单模型。拟合函数表明,带有长尾ePDF的基因表达水平将受到反馈调节的强烈影响。此外,基因表达水平的特征与其功能相关,必需基因水平倾向于遵循高斯型ePDF,而编码核酸结合蛋白和转录因子的基因则具有长尾ePDF。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号