A fast and efficient count-based matrix factorization method for detecting cell types from single-cell RNAseq data

Shiquan Sun; Yabo Chen; Yang Liu; Xuequn Shang

首页> 外文期刊>BMC Systems Biology >A fast and efficient count-based matrix factorization method for detecting cell types from single-cell RNAseq data

【24h】

A fast and efficient count-based matrix factorization method for detecting cell types from single-cell RNAseq data

机译：一种基于计数的快速高效的矩阵分解方法，可从单细胞RNAseq数据中检测细胞类型

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Single-cell RNA sequencing (scRNAseq) data always involves various unwanted variables, which would be able to mask the true signal to identify cell-types. More efficient way of dealing with this issue is to extract low dimension information from high dimensional gene expression data to represent cell-type structure. In the past two years, several powerful matrix factorization tools were developed for scRNAseq data, such as NMF, ZIFA, pCMF and ZINB-WaVE. But the existing approaches either are unable to directly model the raw count of scRNAseq data or are really time-consuming when handling a large number of cells (e.g. n500). In this paper, we developed a fast and efficient count-based matrix factorization method (single-cell negative binomial matrix factorization, scNBMF) based on the TensorFlow framework to infer the low dimensional structure of cell types. To make our method scalable, we conducted a series of experiments on three public scRNAseq data sets, brain, embryonic stem, and pancreatic islet. The experimental results show that scNBMF is more powerful to detect cell types and 10 - 100 folds faster than the scRNAseq bespoke tools. In this paper, we proposed a fast and efficient count-based matrix factorization method, scNBMF, which is more powerful for detecting cell type purposes. A series of experiments were performed on three public scRNAseq data sets. The results show that scNBMF is a more powerful tool in large-scale scRNAseq data analysis. scNBMF was implemented in R and Python, and the source code are freely available at https://github.com/sqsun .

机译：单细胞RNA测序（scRNAseq）数据始终涉及各种不需要的变量，这些变量将能够掩盖真实信号以识别细胞类型。解决此问题的更有效方法是从高维基因表达数据中提取低维信息以表示细胞类型结构。在过去两年中，针对scRNAseq数据开发了几种强大的矩阵分解工具，例如NMF，ZIFA，pCMF和ZINB-WaVE。但是现有方法要么无法直接对scRNAseq数据的原始计数进行建模，要么在处理大量细胞（例如n> 500）时确实非常耗时。在本文中，我们基于TensorFlow框架开发了一种快速高效的基于计数的矩阵分解方法（单细胞负二项式矩阵分解），以推断细胞类型的低维结构。为了使我们的方法具有可扩展性，我们对三个公共scRNAseq数据集（大脑，胚胎干和胰岛）进行了一系列实验。实验结果表明，scNBMF比scRNAseq定制工具具有更强大的检测细胞类型的能力，并且快10-100倍。在本文中，我们提出了一种快速有效的基于计数的矩阵分解方法scNBMF，它对于检测单元格类型的目的更为强大。对三个公开的scRNAseq数据集进行了一系列实验。结果表明，scNBMF是大规模scRNAseq数据分析中更强大的工具。 scNBMF是用R和Python实现的，其源代码可从https://github.com/sqsun免费获得。

著录项

来源
《BMC Systems Biology》 |2019年第2期|共8页
作者
Shiquan Sun; Yabo Chen; Yang Liu; Xuequn Shang;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类生物科学;
关键词
Single-cell RNA sequencingMatrix factorizationRead countDeep learning;

机译：单细胞RNA测序矩阵分解阅读计数深度学习;

相似文献

外文文献
中文文献
专利

1. Mosaic autosomal aneuploidies are detectable from single-cell RNAseq data [J] . Jonathan A. Griffiths, Antonio Scialdone, John C. Marioni BMC Genomics . 2017,第1期

机译：马赛克常染色体非整倍性可从单细胞RNAseq数据中检测到
2. Fast Pathogen Identification Using Single-Cell Matrix-Assisted Laser Desorption/Ionization-Aerosol Time-of-Flight Mass Spectrometry Data and Deep Learning Methods [J] . Papagiannopoulou Christina, Parchen Rene, Rubbens Peter, Analytical chemistry . 2020,第11期

机译：使用单细胞基质辅助激光解吸/电离 - 气溶胶飞行时间质谱数据和深度学习方法的快速病原体鉴定
3. Matrix factorization and transfer learning uncover regulatory biology across multiple single-cell ATAC-seq data sets [J] . Erbe Rossin, Kessler Michael D., Favorov Alexander V, Nucleic Acids Research . 2020,第12期

机译：矩阵分解和转移学习跨多个单小区ATAC-SEQ数据集发现的监管生物学
4. A fast and efficient count-based matrix factorization method for detecting cell types from single-cell RNAseq data [C] . Asia Pacific Bioinformatics Conference . 2019

机译：一种快速有效的基于计数的矩阵分解方法，用于检测单小区RNASEQ数据的小区类型
5. Efficient alternating gradient-type algorithms for the approximate non-negative matrix factorization problem. [D] . Gonzalez, Edward F. 2009

机译：用于近似非负矩阵分解问题的高效交替梯度类型算法。
6. Detecting heterogeneity in single-cell RNA-Seq data by non-negative matrix factorization [O] . Xun Zhu, Travers Ching, Xinghua Pan, -1

机译：通过非负矩阵分解检测单细胞RNA-Seq数据中的异质性
7. A fast and efficient count-based matrix factorization method for detecting cell types from single-cell RNAseq data [O] . Shiquan Sun, Yabo Chen, Yang Liu, 2019

机译：一种快速高效的基于计数的矩阵分解方法，用于检测单小区RNASEQ数据的小区类型

A fast and efficient count-based matrix factorization method for detecting cell types from single-cell RNAseq data

摘要

著录项

相似文献

相关主题

期刊订阅