...
首页> 外文期刊>Journal of computational biology >Structure-Aware Principal Component Analysis for Single-Cell RNA-seq Data
【24h】

Structure-Aware Principal Component Analysis for Single-Cell RNA-seq Data

机译:单细胞RNA序列数据的结构感知主成分分析

获取原文
           

摘要

With the emergence of droplet-based technologies, it has now become possible to profile transcriptomes of several thousands of cells in a day. Although such a large single-cell cohort may favor the discovery of cellular heterogeneity, it also brings new challenges in the prediction of minority cell types. Identification of any minority cell type holds a special significance in knowledge discovery. In the analysis of single-cell expression data, the use of principal component analysis (PCA) is surprisingly frequent for dimension reduction. The principal directions obtained from PCA are usually dominated by the major cell types in the concerned tissue. Thus, it is very likely that using a traditional PCA may endanger the discovery of minority populations. To this end, we propose locality-sensitive PCA (LSPCA), a scalable variant of PCA equipped with structure-aware data sampling at its core. Structure-aware sampling provides PCA with a neutral spread of the data, thereby reducing the bias in its principal directions arising from the redundant samples in a data set. We benchmarked the performance of the proposed method on ten publicly available single-cell expression data sets including one very large annotated data set. Results have been compared with traditional PCA and PCA with random sampling. Clustering results on the annotated data sets also show that LSPCA can detect the minority populations with a higher accuracy.
机译:随着基于液滴的技术的出现,现在已经可以在一天中分析数千个细胞的转录组。尽管如此庞大的单细胞队列可能有利于发现细胞异质性,但在预测少数细胞类型方面也带来了新的挑战。任何少数细胞类型的鉴定在知识发现中都具有特殊的意义。在单细胞表达数据的分析中,使用主成分分析(PCA)令人惊讶地减少了尺寸。从PCA获得的主要方向通常受有关组织中主要细胞类型的支配。因此,很可能使用传统的PCA可能会威胁到少数群体的发现。为此,我们提出了位置敏感型PCA(LSPCA),这是PCA的可扩展变体,其核心具有结构感知数据采样。具有结构意识的采样为PCA提供了中性的数据扩展,从而减少了数据集中冗余采样所引起的主方向偏差。我们在十个可公开获得的单细胞表达数据集(包括一个非常大的注释数据集)上对所提出方法的性能进行了基准测试。将结果与传统PCA和随机抽样的PCA进行了比较。在带注释的数据集上的聚类结果还表明,LSPCA可以更准确地检测少数群体。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号