首页> 外文期刊>MethodsX >Statistical methods for analysis of single-cell RNA-sequencing data
【24h】

Statistical methods for analysis of single-cell RNA-sequencing data

机译:用于分析单细胞RNA测序数据的统计方法

获取原文
           

摘要

Single-cell RNA-sequencing (scRNA-seq) is a recent high-throughput genomic technology used to study the expression dynamics of genes at single-cell level. Analyzing the scRNA-seq data in presence of biological confounding factors including dropout events is a challenging task. Thus, this article presents a novel statistical approach for various analyses of the scRNA-seq Unique Molecular Identifier (UMI) counts data. The various analyses include modeling and fitting of observed UMI data, cell type detection, estimation of cell capture rates, estimation of gene specific model parameters, estimation of the sample mean and sample variance of the genes,etc. Besides, the developed approach is able to perform differential expression, and other downstream analyses that consider the molecular capture process in scRNA-seq data modeling. Here, the external spike-ins data can also be used in the approach for better results. The unique feature of the method is that it considers the biological process that leads to severe dropout events in modeling the observed UMI counts of genes.? The differential expression analysis of observed scRNA-seq UMI counts data is performed after adjustment for cell capture rates.? The statistical approach performs downstream differential zero inflation analysis, classification of influential genes, and selection of top marker genes.? Cell auxiliaries including cell clusters and other cell variables (e.g., cell cycle, cell phase) are used to remove unwanted variation to perform statistical tests reliably.
机译:单细胞RNA测序(ScRNA-SEQ)是最近的高通量基因组技术,用于研究单细胞水平的基因的表达动态。在存在生物混杂因素的情况下分析SCRNA-SEQ数据,包括辍学事件是一个具有挑战性的任务。因此,本文提出了一种新的统计方法,用于SCRNA-SEQ独特的分子标识符(UMI)计数数据的各种分析。各种分析包括观察到的UMI数据的建模和拟合,细胞类型检测,细胞捕获率的估计,基因特异性模型参数的估计,估计样品平均值和基因的样本方差等。此外,开发方法能够进行差异表达,以及其他下游分析,以考虑SCRNA-SEQ数据建模中的分子捕获过程。这里,外部Spike-Ins数据也可以用于方法以获得更好的结果。该方法的独特特征是它考虑了对模拟观察到的基因计数的严重辍学事件导致的生物过程。观察到的ScRNA-SEQ UMI计数数据的差异表达分析是在调整细胞捕获速率之后进行的。统计方法表现下游差分零充气分析,影响基因的分类,以及顶部标记基因的选择。?包括细胞簇和其他细胞变量(例如,细胞周期,细胞相位)的细胞助剂用于去除不需要的变化以可靠地进行统计测试。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号