Learning common and specific patterns from data of multiple interrelated biological scenarios with matrix factorization

Lihua Zhang; Shihua Zhang

首页> 外文期刊>Nucleic acids research >Learning common and specific patterns from data of multiple interrelated biological scenarios with matrix factorization

【24h】

Learning common and specific patterns from data of multiple interrelated biological scenarios with matrix factorization

机译：使用矩阵分解从多个相互关联的生物学场景的数据中学习通用模式和特定模式

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

High-throughput biological technologies (e.g. ChIP-seq, RNA-seq and single-cell RNA-seq) rapidly accelerate the accumulation of genome-wide omics data in diverse interrelated biological scenarios (e.g. cells, tissues and conditions). Integration and differential analysis are two common paradigms for exploring and analyzing such data. However, current integrative methods usually ignore the differential part, and typical differential analysis methods either fail to identify combinatorial patterns of difference or require matched dimensions of the data. Here, we propose a flexible framework CSMF to combine them into one paradigm to simultaneously reveal Common and Specific patterns via Matrix Factorization from data generated under interrelated biological scenarios. We demonstrate the effectiveness of CSMF with four representative applications including pairwise ChIP-seq data describing the chromatin modification map between K562 and Huvec cell lines; pairwise RNA-seq data representing the expression profiles of two different cancers; RNA-seq data of three breast cancer subtypes; and single-cell RNA-seq data of human embryonic stem cell differentiation at six time points. Extensive analysis yields novel insights into hidden combinatorial patterns in these multi-modal data. Results demonstrate that CSMF is a powerful tool to uncover common and specific patterns with significant biological implications from data of interrelated biological scenarios.

机译：高通量生物学技术（例如ChIP-seq，RNA-seq和单细胞RNA-seq）在各种相互关联的生物学场景（例如细胞，组织和条件）中迅速加速了全基因组组学数据的积累。集成和差异分析是探索和分析此类数据的两个常见范例。但是，当前的集成方法通常会忽略差异部分，典型的差异分析方法要么无法识别差异的组合模式，要么需要匹配的数据维度。在这里，我们提出了一个灵活的框架CSMF，将它们组合成一个范式，以通过矩阵分解从相关生物场景下生成的数据中同时揭示常见模式和特定模式。我们用四个有代表性的应用展示了CSMF的有效性，其中包括成对的ChIP-seq数据，描述了K562细胞与Huvec细胞系之间的染色质修饰图。成对的RNA-seq数据代表两种不同癌症的表达谱;三种乳腺癌亚型的RNA-seq数据;和人类胚胎干细胞在六个时间点分化的单细胞RNA-seq数据。广泛的分析产生了对这些多模式数据中隐藏的组合模式的新颖见解。结果表明，CSMF是一个强大的工具，可以从相互关联的生物场景数据中发现具有重要生物学意义的常见和特定模式。

著录项

来源
《Nucleic acids research》 |2019年第13期|共12页
作者
Lihua Zhang; Shihua Zhang;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类 AB;
关键词
embryonic stem cellscancercell lineschromatingenesgenometechnologybreast cancerrnadatasets;

机译：胚胎干细胞癌细胞系铬酸盐烯基因组技术乳腺癌基因组;

相似文献

外文文献
中文文献
专利

1. Learning common and specific patterns from data of multiple interrelated biological scenarios with matrix factorization [J] . Zhang Lihua, Zhang Shihua Nucleic Acids Research . 2019,第13期

机译：从矩阵分解的多个相互关联的生物情景数据学习常见和特定模式
2. Matrix factorization and transfer learning uncover regulatory biology across multiple single-cell ATAC-seq data sets [J] . Erbe Rossin, Kessler Michael D., Favorov Alexander V, Nucleic Acids Research . 2020,第12期

机译：矩阵分解和转移学习跨多个单小区ATAC-SEQ数据集发现的监管生物学
3. Matrix factorization and transfer learning uncover regulatory biology across multiple single-cell ATAC-seq data sets [J] . Rossin Erbe, Michael D Kessler, Alexander V Favorov, Nucleic acids research . 2020,第12期

机译：矩阵分解和转移学习跨多个单小区ATAC-SEQ数据集发现的监管生物学
4. Exploring Common and Distinct Structural Connectivity Patterns Between Schizophrenia and Major Depression via Cluster-Driven Nonnegative Matrix Factorization [C] . Junming Shao, Zhongjing Yu, Peiyan Li, IEEE International Conference on Data Mining . 2017

机译：通过集群驱动的非负矩阵分解探索精神分裂症和重度抑郁之间的共同和不同结构连通性模式
5. The effect of Factor Blocks(TM), a manipulative, on student understanding of greatest common factor (GCF), least common multiple (LCM), and prime factorization (PF). [D] . Getgood, Jacqueline Faillace. 2001

机译：操作性Factor Blocks（TM）对学生理解最大公因子（GCF），最小公倍数（LCM）和素因数分解（PF）的效果。
6. Learning common and specific patterns from data of multiple interrelated biological scenarios with matrix factorization [O] . Lihua Zhang, Shihua Zhang 2019

机译：使用矩阵分解从多个相互关联的生物场景的数据中学习通用模式和特定模式
7. Learning common and specific patterns from data of multiple interrelated biological scenarios with matrix factorization [O] . Lihua Zhang, Shihua Zhang 2019

机译：从矩阵分解的多个相互关联的生物情景数据学习常见和特定模式

Learning common and specific patterns from data of multiple interrelated biological scenarios with matrix factorization

摘要

著录项

相似文献

相关主题

期刊订阅