...
首页> 外文期刊>plos computational biology >Non-linear archetypal analysis of single-cell RNA-seq data by deep autoencoders
【24h】

Non-linear archetypal analysis of single-cell RNA-seq data by deep autoencoders

机译:通过深度自动编码器对单细胞RNA-seq数据进行非线性原型分析

获取原文
获取原文并翻译 | 示例

摘要

Advances in single-cell RNA sequencing (scRNA-seq) have led to successes in discovering novel cell types and understanding cellular heterogeneity among complex cell populations through cluster analysis. However, cluster analysis is not able to reveal continuous spectrum of states and underlying gene expression programs (GEPs) shared across cell types. We introduce scAAnet, an autoencoder for single-cell non-linear archetypal analysis, to identify GEPs and infer the relative activity of each GEP across cells. We use a count distribution-based loss term to account for the sparsity and overdispersion of the raw count data and add an archetypal constraint to the loss function of scAAnet. We first show that scAAnet outperforms existing methods for archetypal analysis across different metrics through simulations. We then demonstrate the ability of scAAnet to extract biologically meaningful GEPs using publicly available scRNA-seq datasets including a pancreatic islet dataset, a lung idiopathic pulmonary fibrosis dataset and a prefrontal cortex dataset.
机译:单细胞RNA测序(scRNA-seq)的进步使人们成功地发现了新的细胞类型,并通过聚类分析了解了复杂细胞群中的细胞异质性。然而,聚类分析无法揭示跨细胞类型共享的连续状态和潜在基因表达程序 (GEP)。我们引入了scAAnet,一种用于单细胞非线性原型分析的自动编码器,用于识别GEPs并推断每个GEP在细胞中的相对活性。我们使用基于计数分布的损失项来解释原始计数数据的稀疏性和过度分散性,并为scAAnet的损失函数添加原型约束。我们首先通过模拟表明,scAAnet在不同指标的原型分析方面优于现有的方法。然后,我们展示了 scAAnet 使用公开可用的 scRNA-seq 数据集(包括胰岛数据集、肺特发性肺纤维化数据集和前额叶皮层数据集)提取具有生物学意义的 GEP 的能力。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号