...
首页> 外文期刊>BMC Bioinformatics >Learning a hierarchical representation of the yeast transcriptomic machinery using an autoencoder model
【24h】

Learning a hierarchical representation of the yeast transcriptomic machinery using an autoencoder model

机译:使用自动编码器模型学习酵母转录组学机制的层次表示

获取原文
           

摘要

A living cell has a complex, hierarchically organized signaling system that encodes and assimilates diverse environmental and intracellular signals, and it further transmits signals that control cellular responses, including a tightly controlled transcriptional program. An important and yet challenging task in systems biology is to reconstruct cellular signaling system in a data-driven manner. In this study, we investigate the utility of deep hierarchical neural networks in learning and representing the hierarchical organization of yeast transcriptomic machinery. We have designed a sparse autoencoder model consisting of a layer of observed variables and four layers of hidden variables. We applied the model to over a thousand of yeast microarrays to learn the encoding system of yeast transcriptomic machinery. After model selection, we evaluated whether the trained models captured biologically sensible information. We show that the latent variables in the first hidden layer correctly captured the signals of yeast transcription factors (TFs), obtaining a close to one-to-one mapping between latent variables and TFs. We further show that genes regulated by latent variables at higher hidden layers are often involved in a common biological process, and the hierarchical relationships between latent variables conform to existing knowledge. Finally, we show that information captured by the latent variables provide more abstract and concise representations of each microarray, enabling the identification of better separated clusters in comparison to gene-based representation. Contemporary deep hierarchical latent variable models, such as the autoencoder, can be used to partially recover the organization of transcriptomic machinery.
机译:活细胞具有复杂的,层次结构化的信号系统,该系统编码并吸收各种环境和细胞内信号,并且还可以传输控制细胞反应的信号,包括严格控制的转录程序。系统生物学中一项重要而又具有挑战性的任务是以数据驱动的方式重建细胞信号系统。在这项研究中,我们调查了深度层次神经网络在学习和代表酵母转录组学机制的层次组织中的实用性。我们设计了一个稀疏的自动编码器模型,该模型由一层观察变量和四层隐藏变量组成。我们将该模型应用于一千多个酵母微阵列,以了解酵母转录组学机制的编码系统。选择模型后,我们评估了训练有素的模型是否捕获了生物学上的敏感信息。我们显示第一隐藏层中的潜在变量正确捕获了酵母转录因子(TFs)的信号,从而在潜在变量和TF之间获得了接近一对一的映射。我们进一步表明,在较高的隐蔽层中由潜在变量调节的基因通常参与共同的生物过程,并且潜在变量之间的层次关系符合现有知识。最后,我们表明,由潜在变量捕获的信息为每个微阵列提供了更抽象和简洁的表示形式,与基于基因的表示形式相比,能够识别更好的分离簇。当代的深层次潜在变量模型,例如自动编码器,可用于部分恢复转录组机器的组织。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号