...
首页> 外文期刊>Journal of computational biology: A journal of computational molecular cell biology >Identifying Cell Subpopulations and Their Genetic Drivers from Single-Cell RNA-Seq Data Using a Biclustering Approach
【24h】

Identifying Cell Subpopulations and Their Genetic Drivers from Single-Cell RNA-Seq Data Using a Biclustering Approach

机译:使用双细胞RNA-SEQ数据识别细胞群及其遗传驱动因素

获取原文
获取原文并翻译 | 示例
           

摘要

Single-cell RNA-Seq (scRNA-Seq) has attracted much attention recently because it allows unprecedented resolution into cellular activity; the technology, therefore, has been widely applied in studying cell heterogeneity such as the heterogeneity among embryonic cells at varied developmental stages or cells of different cancer types or subtypes. Apertinent question in such analyses is to identify cell subpopulations as well as their associated genetic drivers. Consequently, a multitude of approaches have been developed for clustering or biclustering analysis of scRNA-Seq data. In this article, we present a fast and simple iterative biclustering approach called "BiSNN-Walk'' based on the existing SNN-Cliq algorithm. One of BiSNN-Walk's differentiating features is that it returns a ranked list of clusters, which may serve as an indicator of a cluster's reliability. Another important feature is that BiSNN-Walk ranks genes in a gene cluster according to their level of affiliation to the associated cell cluster, making the result more biologically interpretable. We also introduce an entropy-based measure for choosing a highly clusterable similarity matrix as our starting point among a wide selection to facilitate the efficient operation of our algorithm. We applied BiSNN-Walk to three large scRNA-Seq studies, where we demonstrated that BiSNN-Walk was able to retain and sometimes improve the cell clustering ability of SNN-Cliq. We were able to obtain biologically sensible gene clusters in terms of GO term enrichment. In addition, we saw that there was significant overlap in top characteristic genes for clusters corresponding to similar cell states, further demonstrating the fidelity of our gene clusters.
机译:单细胞RNA-SEQ(ScRNA-SEQ)最近引起了很多关注,因为它允许前所未有的分辨率进入细胞活动;因此,该技术已被广泛应用于研究细胞异质性,例如在不同癌症类型或亚型的各种发育阶段或细胞中的胚胎细胞之间的异质性。这种分析中的内部问题是识别细胞群以及相关的遗传司机。因此,已经开发了众多方法用于对ScrNA-SEQ数据进行聚类或双层分析。在本文中,我们提出了一种基于现有SNN-CLIQ算法的快速和简单的迭代双板方法,称为“Bisnn-Walk”。Bisnn-Walk的区分功能之一是它返回一个排名的群集列表,可以作为群集可靠性的指标。另一个重要特征是,根据其与相关细胞聚类的隶属度,Bisnn-Walk在基因簇中排名基因,使得结果更加生物学解释。我们还引入了基于熵的选择措施一种高度聚焦的相似性矩阵作为我们在广泛选择中的起点,以促进我们算法的有效运行。我们将Bisnn-Walk播放到三个大的Scrna-SEQ研究,在那里我们证明了Bisnn-Walk散步能够保留,有时可以保留和有时改善SNN-CLIQ的细胞聚类能力。我们能够在富集的富集方面获得生物学上合理的基因集群。此外,我们看到有重要意义在对应于类似细胞状态的簇的顶部特征基因中重叠,进一步展示了我们基因簇的保真度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号