...
首页> 外文期刊>SIGKDD explorations >Semi-Supervised Generative Adversarial Network for Gene Expression Inference
【24h】

Semi-Supervised Generative Adversarial Network for Gene Expression Inference

机译:基因表达推理的半监督生成对抗网络

获取原文
获取原文并翻译 | 示例
           

摘要

Gene expression profiling provides comprehensive characterization of cellular states under different experimental conditions, thus contributes to the prosperity of many fields of biomedical research. Although the rapid development of gene expression profiling has been observed, genome-wide profiling of large libraries is still expensive and difficult. Due to the fact that there are significant correlations between gene expression patterns, previous studies introduced regression models for predicting the target gene expressions from the landmark gene profiles. These models formulate the gene expression inference in a completely supervised manner, which require a large labeled dataset (i.e. paired landmark and target gene expressions). However, collecting the whole gene expressions is much more expensive than the landmark genes. In order to address this issue and take advantage of cheap unlabeled data (i.e. landmark genes), we propose a novel semi-supervised deep generative model for target gene expression inference. Our model is based on the generative adversarial network (GAN) to approximate the joint distribution of landmark and target genes, and an inference network to learn the conditional distribution of target genes given the landmark genes. We employ the reliable generated data by our GAN model as the extra training pairs to improve the training of our inference model, and utilize the trustworthy predictions of the inference network to enhance the adversarial training of our GAN network. We evaluate our model on the prediction of two types of gene expression data and identify obvious advantage over the counterparts.
机译:基因表达分析在不同的实验条件下提供了细胞状态的综合表征,从而有助于许多生物医学研究领域的繁荣。虽然已经观察到基因表达分析的快速发展,但大型文库的基因组曲线仍然昂贵且困难。由于基因表达模式之间存在显着相关性,先前的研究引入了预测地标基因谱预测靶基因表达的回归模型。这些模型以完全监督的方式配制基因表达推理,其需要大标记的数据集(即配对的地标和靶基因表达)。然而,收集全基因表达比地标基因更昂贵。为了解决这个问题并利用廉价的未标记数据(即载体基因),我们提出了一种新型半监督的靶基因表达推理模型。我们的模型基于生成的对抗网络(GAN),以近似地标和靶基因的关节分布,以及推导网络以鉴于地标基因的靶基因的条件分布。我们通过我们的GaN模型采用可靠的生成数据作为额外的培训对,以改善推理模型的培训,利用推理网络的值得信赖的预测来增强我们的GAN网络的对抗培训。我们在预测两种类型的基因表达数据上评估我们的模型,并通过对应物来确定明显的优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号