首页> 外文期刊>BMC Bioinformatics >FastProject: a tool for low-dimensional analysis of single-cell RNA-Seq data
【24h】

FastProject: a tool for low-dimensional analysis of single-cell RNA-Seq data

机译:FastProject:用于单细胞RNA-Seq数据的低维分析的工具

获取原文
           

摘要

Background A key challenge in the emerging field of single-cell RNA-Seq is to characterize phenotypic diversity between cells and visualize this information in an informative manner. A common technique when dealing with high-dimensional data is to project the data to 2 or 3 dimensions for visualization. However, there are a variety of methods to achieve this result and once projected, it can be difficult to ascribe biological significance to the observed features. Additionally, when analyzing single-cell data, the relationship between cells can be obscured by technical confounders such as variable gene capture rates. Results To aid in the analysis and interpretation of single-cell RNA-Seq data, we have developed FastProject, a software tool which analyzes a gene expression matrix and produces a dynamic output report in which two-dimensional projections of the data can be explored. Annotated gene sets (referred to as gene ‘signatures’) are incorporated so that features in the projections can be understood in relation to the biological processes they might represent. FastProject provides a novel method of scoring each cell against a gene signature so as to minimize the effect of missed transcripts as well as a method to rank signature-projection pairings so that meaningful associations can be quickly identified. Additionally, FastProject is written with a modular architecture and designed to serve as a platform for incorporating and comparing new projection methods and gene selection algorithms. Conclusions Here we present FastProject, a software package for two-dimensional visualization of single cell data, which utilizes a plethora of projection methods and provides a way to systematically investigate the biological relevance of these low dimensional representations by incorporating domain knowledge.
机译:背景技术在单细胞RNA-Seq新兴领域中的一个关键挑战是表征细胞之间的表型多样性并以信息方式可视化此信息。处理高维数据时,一种常见的技术是将数据投影到2或3维以进行可视化。但是,有多种方法可以实现此结果,一旦进行预测,就很难将生物学意义归因于观察到的特征。此外,在分析单细胞数据时,技术混杂因素(例如可变的基因捕获率)可能会掩盖细胞之间的关系。结果为了帮助分析和解释单细胞RNA-Seq数据,我们开发了FastProject,这是一种软件工具,可以分析基因表达矩阵并生成动态输出报告,其中可以探索数据的二维投影。合并了带注释的基因集(称为基因“签名”),以便可以理解投影中的特征与其可能代表的生物过程有关。 FastProject提供了一种针对基因签名对每个细胞进行评分的新方法,以最大程度地减少遗漏的转录本的影响,还提供了一种对签名-投影配对进行排名的方法,以便可以快速识别有意义的关联。此外,FastProject是采用模块化体系结构编写的,旨在用作整合和比较新的投影方法和基因选择算法的平台。结论在这里,我们介绍了FastProject,这是一种用于单细胞数据二维可视化的软件包,它利用了大量的投影方法,并提供了一种通过结合领域知识来系统地研究这些低维表示形式的生物学相关性的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号