首页> 外文会议>Chinese Automation Congress >Identification of disease-related genes based on tensor factorization with RNA-seq data of huntington's disease mice
【24h】

Identification of disease-related genes based on tensor factorization with RNA-seq data of huntington's disease mice

机译:基于张量分解和亨廷顿病小鼠RNA-seq数据的疾病相关基因鉴定

获取原文

摘要

In recent years, with the development of next-generation sequencing technology, large amounts of omics data have been generated, making it possible to explore the molecular mechanisms of Huntington's disease by computational methods at a genome-wide scale. Since the pathology mechanisms of the neurodegenerative diseases are complicated, the traditional computational methods cannot effectively identify the most disease-related genes. In this paper, we propose a new approach based on tensor factorization to analyze the RNA-seq data of Huntington's disease (TFR). According to the approach, we also design a new framework to identify the disease-related genes. Firstly, the RNA-seq data are mapped into three low-dimensional spaces by TFR, i.e. the gene space, the sample space and the time space. We assume that the common components obtained by TFR in the three subspace represent the hidden biological signals that affect gene expression. Then, the disease-related biological signals are selected, and a ranked list is obtained by sorting the genes according to the gene expression value shaped by the disease-related biological signals. The ability for extracting dependence structures of the gene expression data makes TFR more robust and efficient to identify disease-related genes. Experimental results on the RNA-seq data of Huntington's disease mice demonstrate that TFR outperforms the traditional methods. It has been shown that TFR improves the identification accuracy of the disease-related genes as well as the precision of the top ranked genes.
机译:近年来,随着下一代测序技术的发展,已经产生了大量的组学数据,从而有可能通过计算方法在全基因组范围内探索亨廷顿氏病的分子机制。由于神经退行性疾病的病理机制复杂,传统的计算方法无法有效地识别与疾病最相关的基因。在本文中,我们提出了一种基于张量分解的新方法来分析亨廷顿舞蹈病(TFR)的RNA-seq数据。根据该方法,我们还设计了一个新框架来鉴定与疾病相关的基因。首先,通过TFR将RNA-seq数据映射到三个低维空间,即基因空间,样品空间和时间空间。我们假设通过TFR在三个子空间中获得的公共成分代表了影响基因表达的隐藏生物信号。然后,选择与疾病相关的生物信号,并根据由疾病相关的生物信号整形的基因表达值对基因进行分类,从而获得排名表。提取基因表达数据的依赖性结构的能力使TFR更加强大和有效地鉴定与疾病相关的基因。对亨廷顿舞蹈病小鼠的RNA-seq数据进行的实验结果表明,TFR优于传统方法。已经表明,TFR提高了疾病相关基因的鉴定准确性以及排名靠前的基因的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号